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AESTRACT 

A laltiplicative (loglinear) nodel for s^iUire tables 
or other cross-ciasslf ications is helpful in locating cells where 
cotints are relatively dense or sparse. This p^del helps to eliiinate 
the confounding prevalence and interaction effects. It yields a 
parsiionious set" of parasieters vhich describe the table, and goodness 
of fit can be assessed vith standard inferential procedures. A 
■ultiplicative specification which fits a particular 
cro8s*clas8lficatlon lay be obtained in several vays. The aodel is 
illustrated using data on the occupational ■obility.of Aaarican sen, 
on the educational attainient of Wisconsin sibling pairs, and on the 
occupttions of lale Detroit frien»1ship-pairs. (Author/HKI 
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ABSTRACT 

The Structure of Social Relationships: Cross-Classifications 
of. Mobilicy, Kinship, and Friendship 

The paper describes a multiplicative (loglinear) model for square tables 
(or other cross-classifications) which is helpful in locating cells where 
counts are relatively dense or sparse. This specification eliminates the 
confounding of prevalence and interaction effects, which has plagued other 
schemes for interpreting such tables. The model yields a parsimonious set 
of parameters which describe the table, and goodness of fit can be assessed 
with standard inferential procedures. A multiplicative specifiration which 
fits a particular cross-classification may be obtained in any of several ways. 
For example, one may begin with a complete or partial theory about the cross- 
classification, or one may begin without a theory. By examining residuals 
fro. .it unde*- a given model, ^it is possible to improve the specification 
in successive rounds of estimation. The counts may be smoothed or aggregated 
. to minimize the chances of fitting and interpreting trivial or unreliable 
fluctuations in them. Maximum- likelihood estimation is emphasized, but 
diagnostic information may be obtained using computationally simpler algorithm 
The model (and associated inferential methods) can also be used in the com- 
parison of two or more classifications. The exposition is illustrated using 
data on che occupational mobility of Ame-ican men, on the educational attain- 
ments of Wisconsin sibling-pairs, and on the occupations of male Detroit 
friendship-pairs. 



Ine general problem may be stated as follows: Having given the number 
of instances respectively in which things are both thus and so, in they are 
thus but not so- in which they are so but not thus, and In which they are 
neither thus nor so, it is required to eliminate the general quantitative 
' relativity inhering in the mere thingness of the things, r^nd to detennine the 
special quantitative relativity subsisting between the thusness and the soness 
of the things. 

Doolittle (1888) as quoted by 
Goodman end Kruskal (1959:131) 



Social scieptlsts often «r.a1yze square tables of counts, where persons, 
relationships, qr other subjects of Interest have been classified twice 
using the sane set of categories. For exaaiplet In studies of social iiobfllty 
persons are often classified by their own occupation (or ed'jcatlon or social 
class) and by the occupation (or education or social class) of their fathers. 
Marriages May be classified by the occupation,, education* ethnlcltjr, or reli- 
gion of each Mte. Persons may be cross-classified by place of birth and 
place of residence, by political party preference before and after an elec-^ 
toral caiipaign, and so on. 

While these square tables of counts nay be Interpreted and analyzed In 
nany ways, one pl#us^ble and traditional Interpretation says that each 
observed count has two cowponents. First, there are effects of the prevalence 
of observations within each category of the classifications taken singly. 
For exanple. In a classification of American i»en by their own and by their 
fathers' Jobs, one expects to find aany blue-collar sons with blue-collar 
fathers, simply because many men work at blue-collar Jobs. Second, there 
are greater or lesser tendencies for categories to Interact, that Is, to 
occur Jointly. To continue the example, one expects to find many men In the 
same occupations as their fathers and few In vastly dissimilar occupations. 

The verbal distinction between prevalence and Interaction Is easy to 
maintain, but for many years a sound statistical representation of It bluded 
the efforts of social scientists. The history of this pursuit and the common 
faults of proposed solutions have been reviewed by Mauser (1978; also, see 
Featherman and Mauser 1978: Ch. 4). In the next section of this paper I 
describe a class of oultlpllcatlve (^oj^Jnear) models whose parameters^corre- 
spond exactly to the Intuitive concepts of prevalence and Interaction effects. 
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CmplrlcaHy, the correspondence between parameters and concepts only becomes 
useful when a model of the desired form fits the data. Since one always 
asstmies the existence of prevalence effects for categories of the classlfl- 
catory variables, the efi9)1r1ca1 problem Is to specify the form of the Inter- 
actions. Sometimes, sociological theory will provide sufficient guidance 
1.' model specification (Goldthorpe and Payne 1978, Mope 1980), but.often 
tNeory will provide Incorrect, Incomplete, or contradictory directions. For 
these reasons I describe methods for assessing goodnecs of fit and for Improv- 
ing specification through the examination of residuals. After Illustrating 
these Ideas In an exploratory analysis of an American father-son occupational 
mobility table, I show how several conventional statistical analyses of the 
same tabf^e lead to misleading conclusions. Users of empirically based 
search strategies run the risk of overfitting data; that is, one loses 
parsimony and reliability by seeking to fit every feature of a sanple of 
observations. In order to minimize such misuses of empirically guided search 
methods, i have elsewhere described methods of aggregating and smoothing data 
prior to model selection (Mauser : 79). In the next section of the paper I 
give two ore empirical illustrations of the model: an analysis of similarity 
in the educational attainments of Wisconsin sibling pairs and an analysis of 
similarity in the occupational positions held by Detroit men and their friends* 
The traditional distinction between prevalence and interaction effects is 
motivated in part by an interest in comparing these components across time 
and place or between segments of a society. In the last section of the paper ' 
I use American mobility data for several cohorts to show how models cl the 
present form may be used to measure and to interpret differences among 
populatioi^. 
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A HuU<pnc<t<¥e Mod el of the MobHUy Table 

Wf Model Is a special case of Goodnan's (1972c) general Multiplicative 
Model Joi^ cross-classified data, but I take a slightly different approach ^ 
froM hiM, In developing Models of the Mobility table. First. I IImU 
attention to the class of Models In which there is only one Interaction 
PArdM^ter for each cell in the classification. Second* I do. not assume that 
the categories are ordered. Third, I eMphasIze the use of exploratory Methods 
fn Model specification. Elsewhere, these Models and Methods have been applied 
In analyses of the 1949 British Mobility table (Kauser 1978), of several 
AMerlcan Mobility tables (Feathenian and Hauser 1978: Ch. 4), of Rogoff*s 
(1953) jndlanapolls Mobility tables (Baron 1977, 1980). o' a 1972 British 
Mobility table (Coldthorpe and Payne 1980), and of an AMerlcan ethnic Inter- 
Marriage table (Shavit 1978). 

let xjj be the observed frequency'ln the 1j^^ cell of a classification 
where 1 * l,...,l and J * l»...»J. In the p'^sent context the saMe categories 
will appear In the saMe order In rows and coluMns, and the table will be " 
square with I * J. For k * l,...,K, let be a Mutually exclusive and 
exhaustllre partition of the pairs (1,J) In ;4h1ch 

Elx^jJ-M^j .«e^tj«<j (1) 

where «|j ■ «|^ for (1,J)c Mj^, subject to the nomial Izatlon flii| « Nyj " «n6|j * I. 

The nonaallzatlon of paraMeters 1$ a Matter of convenience, and we choose the- 
vaiu« of a so It will hold. Note that In contrast to the conventional 
structural Modtj for counted data (Bishop, Fler.berg and Holland I97S: Ch. 2), 
the Interaction effects In equation I are not constrained within rows or 
coluMns even though the Marginal frequencies are fitted exactly. 



The model says t|r expected frequencies are a product of an overall 
effect {«), a row effect (0^), a column effect (yj), and dn Interaction effect 
(<lj)* The row and column parameters correspond to the concept of prevalence. 
For example, in an occupat1onaUp*oblllty table the y^ re fleet o&cupHijpnal 
supply and demand, demographic replacemc^it processes, and past and present 
technologies and economic conditions. The ceTh (I, j) are assigned to K - 
mutually exclusive and exhaustive subsets, and each of those sets shares^ a * 
coiiinon interaction parameter, Tl.dS, aside from total, row, and column 
effects, each expected frequency Is determined by only one Interaction para^^ 
meter, which reflects the density of observations in that cell Velative to 
that in other cells in the table. That :s, the interaction parameters of 
the model correspond directly to the concept of the Joint density of observa- 
tions (White 1963:26), and they may be interorcted as Indexes of tfie *;of lal 
distance between categories of the row and column classifications (cop^are 
Rogoff 1953:31*32). 

Uhile my model is a special case of Goodman's (1972c, sec. 3) general 
multiplicative ^dei, unlike several of the models which Goodman (Table V) 
applied to the British (and Danish) mobilO / tables, model doe^ not assume 
ordinal measurement of occupations.^ Of course, the assumption of ordinality 
may help in interpreting results, or empirical findings may be used to explore 
the metric properties of a classification. The hierarchical dimension is 
strong in most occupational classifications, and the preseTit applications 
are artificial in ignoring that. 

For the model to be informative, the distribution of levels across the 
cells of the table must form a Meaningful pattern, and one* in which the 
paraMeters are identified (Mason, Nason, Winsborough and Poole 1973^ ilabennan 
1974:217). Further, the number of levels (K) should be substantially less J 
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than the number of cells In the table. These latter properties ar,e p»rl1y 
matters of ^substantive and statistical Interpretation and judgoient, rattier 
than characteristics of the general model or of^the d^ti. I have found It 
difficult to Interpret models where the number of levels Is much greater 
tlian the nuNiber cf categories recognized In the occupational classif katlon. 

It may be helpfuTto present the model of equation 1 In more than qne 
way. There Is a pronounced' rigfitward skew In «iuU1p1 icatlve effects because 
decremental effects are bounded between 0 and ), while Incremental effects 
are unbounded. It Is. for this reason, useful to take logs of frequencies 
and parameters and to write the model In additive form; then Incremental and 
decre«enta1 effects may each range from zero to Infinity In absolute value. 

let u » log a, u^^j » log U2jjj * log Yj. ";.2(1J) * ^1J* 
* 

"3(k) " *k' nwdel Is 

log ' ./ ♦ u|(,j * uj^^j ♦ u;2(,j, (2) 

* 4 4 

^12t1J) "3(k) ^^'^ J) \» defined as before. Here. 

« * * 

the nom^^llzatfon of parameters Is ^I'n^j ' ^"2(J) * "**"l?(1J) " ^' 

A flight variation of equation 1. which I present In multiplicative form. 
Is more suggestive of the way In which I have manipulated empirical data for 
purposes of estlfiallon and testing. With defined before^ let 

EXx^j) - m^jj^ « "^1^J*k 0. J) r \ (3) 

and 

*1Jk ' ° ^^''^ (1. J) ^ H^. (4) 

"k 

, • nti • H6l « 1. where n. Js the nuR*)er of 



"k 

subject to the norwalUatlon «0 - " - 
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dls assigned to the k level. TM<i version of the model suggests a 
^diir-^nsional representation of the original Z^-dimsnsional table in which*' 
J(K-l) ofi^the interior cells contain structural zeros, and the original IJ 
requencles are fitted by row column (ij) andjevel {^y.) pa»^anieters , % 

s under a model of quasi -independence (Goodman 1972c: 689; Bishop. Fienbcrg 
nd Holland 1975:225-226). 

To estimate and test models of the present form I have represented croLs- 
la^if ications as Incomplete imjTtiwa/ arrays, and I have used Fay and 
oodman's (1973) computer p^ram. ECTA. to estimate frequencies by Iterative 
escaling and to run tests of goodness tf fit (and other hypotheses). Under 
he u:>ud3 saiijpling assun»f>t1ons . e.g.. that the data were obtained by Indepen- 
ent Polsson or simple inultinomlal sampling, maximum likelihood estimates are 
btained tnHhIs way (Goodman 1972c:663-667; Bishop. Fienberg and Hilland 1975i 
06-2081. The likelihood ratio test statistic (G^) computed by the program 
s asymptotically distributed as with degrees of freedom equal to IJ. the 
umber-cf cells In the array which are not structural zeros, less the numl>er 
f Independent parameters which have been estimated. Often this will be IJ - 1 
(I-l) - (J-1) - (K-1) = (I-l) (J-l) - (K-l). but It may be greater, depend- 
ng on the arrangement of levels within the orl9lnar 2-way array (Bishop, 
lenliorg and Holljn'i 19/5: Ch 5. r .p. 227; B^land and Fortier 197P). Sreat 
are should be used In computing degrees of freedom when the design specifies 
eparable subtable^ (Bishop, Fienberg and Mollbfid 197S: Ch. 5). 

ECTA does not estimate paraRieters for models of Incomplete tables. I 
tave estimated the (additive) parameters by regressing logs of estimated 
requencles on dwiiwy-variable representations of the rows, columns, and levels 
»f the design. That Is. I created a duinny variable for each row (but one), 
or each column (but one), and for each level (but one); then I regressed 

12 



; 

I09S of estlMted freiiuencles on ttiese three sets of variables. 6> construc- 
t'ton this regression completely accounted for the estimated freouencles. I 
used an auxiliary program to renonnallze the parameter estimates as deviations 
from the grand mean and to compute and display residuals. Using other 
packaged programs for the analysis of categorical data, one can estimate the 
models and obtain parameter estimates and measures of fit In a single pass 
by the methods of maximum likelihood or weighted least -squares (Evers and 
Namboodlrl 1979; Goldthorpe and Payne 1980). 

(H presenting goodness of fit tests and comparing alternative models, 
it Is convenient to i!se a single letter to denote each varlaule. For 
example. In the next section I let P ' father's occupation, S > son's occu- 
pation, and H the levels of interaction to which the several ce1!s In 
the mobility table are assigned In the model. Following the conventional 
notation. In which th^ highest oru<»r marginal distributions fitted under a 
given model ^re listed in a series of parentheses, I denote the model hy 
(P)(S)(H). Written In this form It Is clear that the model U one of stat- 
istical independlsnc^, conditional on th^asslgnnent of cells In the P by S 
table id levels of M. Un<^er the modef the association between P and S Is 
spurious; no association (quasi -Independence) between P and S occurs within 
levels of H (Goodman 1^72c:689). One could think of the sche- as a lament 
factor or latent striicture model In which the levels of H are latent classes 
(Goodman 1974:1231). However, the assignment of cells and hence, of observa- 
tions to levels of H, {s sirlctly deterministic, so the term "manifest 
class** might be more fitting. 

Hob U ity to f 1 rs t^ JWi^ of Amer ican Men 

Table 1 gives frequencies In a classification of son*s first, full-time 
riwU««n occupation by father's (or other family head's) occupation at snn*s 



16th birthday among American men wim were arjed 20 to 64 in 1973 and were not 
currently enrolled in school. The data ^ere obtained In the Occupational 
Changes In a Generation (OCG) s pplement to the March 1973 Current Popula- 
tlon Survey (Feathemian and Hauser 1975, 1978). Table 2 is a convenikfnt 

T ABlM" ABOUT HE RE 

display of the final model for the data of Table 1. Each numeric entry 1?^ 
the body of the tab^e gives Ihc level of to which the corresponding entry 
in the frequency tabl^ WuS assigned; one may think of them as subscripts of 
duniny variables pertaining to the density of Interaction In the several 
regions of the table. Formally, the entries are merely labels, but for 
co^/anience in interpretation the numeric values are inverse to the estimated 
density of mobility or immbility in the cells to which tiiey rpfer. I offer 
^ ETl^Ill rationale for the specification of interaction effects In Table 
2; it is the outcome of a search procedure that I describe later. 

TASirrADOUTHERF 

On this understanding the model says that, aside from conditions of 
supply and demand, iRinobility is highest in farm occupations (level 1) and 
next highest in the upper nonmanual category (level 2). If one takes the 
occupation groups as ranked fron high to low in the order listed, one may 
say there are zones of high and almost uniform density bordering the peaks 
at either end of the status distribution. There is one zone of high density 
that includes upward or downwar oventents oetween tlie two nonmanual groups 
and imnobility in the lower nonmanual group. Mobility from lower to up|>er 
nonmi'nual occupations (level 3) is more likely than the opposite movement, 
and the tatter is as likely as stability in the lower nonmanual category 1 ^ 



(level 4). Horeover, the densities of Immobility In the lower nonmanual 
category and of downward Mobility to It are Identical to those In the second 
zone of relatively high density, which occurs near the lower end of the 
occupational hierarchy. The second zone Includes movenients froii the farm 
to the lowei manual group and back as well as Immobility In the lower ftianual 
group, last, ther* 1$ a broad zone of relatively low density (level 5) that 
Includes Iwcbllity in the upper aianwal category, upward and downward wioblllty 
within the manual stratum, liability between upper mnual and farw q» ups 
and all «Dve«ients between nonmanual and either manual or farm groups. 
The design says that an upper manual worker's son Is equally likely to be 
iMHoblle or to move to the bottom or top of the occupational hierarchy; 
obversely. It says that an upper manual worker Is equally likely to have been 
recruited from any location In the occupational hierarchy, including his own. 

It Is worth noting that four of the five Interaction levels recognized 
In the model occur along the main diagonal, and two of these (levels 4 and 6) 
are assigned both to diagonal and to off-diagonal cells. Thus, Imnoblllty 
varies among occupational strata, and It Is in some cases less likely than 
mobility. Also, with a single exception the '3slgn Is symnetrlc. That Is, 
net of row and column effects upward mobility Is more prevalent than downward 
mDblllty within the nonmanual group. This asywuetry in the design Is striking 
because suggests the power^f upper white collar families to block at 
least one type of status loss and because It Is the gnljf asymnetry in the 
design. For example, BUu and Duncan (1967:58-67) suggest that there are 
semi -permeable class bo*jndarles separating white collar, blue collar, and 
farm occupations, which permit upward mobility but inhibit downward' Mobll Ity. 
The only asyvmetry In the present design occurs within one of the broad 
classes delineated by Blau and Duncan.^ 
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Table 3 gives the row, column and Interaction effects estimated In the 
ig73 OCG data under the model of Table 2 for intergenerational mobility to 
son's first Job. The pstlmatps are expressed In additive form; that Is, they 
are effects on logs of frequencies under the model of equation 2. The row 
and column effects clearly show an intergenerational shift out of farming 
and Into white collar or lower blue cellar occupations. These reflect temporal 
shifts in the distribution of the labor force across occupations, differential 
fertility, and life-cycle differences In occupational positions. The inter- 
action effects show very large differences In mobility and imnoblllty across 
the several cells of the classification, and these differences closely follow 
my Interpretation of the display in Table 2. Differences between Interaction 
effects may readily be Interpreted as differences In the logs of the estimated 
frequencies, net of row and column effects. For erample, the estiinates say 
that the inmobility In farm occupations (at level 1) Is 3.40 = 3.D44 - ( .356^ 
greater (In the metric of logged frequencies) than the estimated mobility or 

inmobility in eells assigned to Interaction level 5 In Table 2. In miilti- 

3 40 

pllcatlve terms. Immobility in farm occupations Is e * 29. g6 time^* greater 
than mobility or immobility at level 5. It would be incorrect to attach too 
much importance to the signs of the interaction effects reported In Table 3, 
for they simply reflect our normalization rule that Interaction effects sum 
to zero (In the log-frequency metric) across the cells of the table. For 
example, while the effects of levels 4 and 5 each reflect relatively low 
densities. It Is not clear that either effect Indicates "status disinheritance** 
In the diagonal cells to which it pertains (compare Goodman 1969ai ig69b). 

in any rvent the effects do show a sharp density gradient across Interaction 
levels. The «;mallest difference, between levels 3 and 4, indiofites a relative 
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densUy e*^*^^" = e » 1.36 times as great at level 3 than at level 
4. Imiobility in fa m occupations )nd in upper nonmanual occupations is 
quite distinct fro« densitiei at other levels, but also iwwbility in the 
fiimi occupations is e''*^^^ - 1.234 ^ ^1.810 , ^^^^ ^^^^^ 3^ jhe 
upper nonmanual occupations. 

Overall, the design reseiibles a valley in which two broad plains are 
Joined by a narrow strip of land between great peaks. The contours of the 
peaks differ in that the one forailng one side of the valley is both taller 
and more nearly sywuetric than that foming the other side.' Figure 1 Is a 
pictorial representation of the Model. whose dimensions are based on my esti- 
mates of the row. column, and interaction effects (in multiplicative form). 
The base of the figure is a unit square; that is, I have renormalized the 
(multiplicative) row and column effects so the sum of each set is one. 
Further, the total volume under the surface is one. Thus, length and breadth 
can be read as probabilities, and height ii proportionate to probability. 
Variations in interaction effects (the vertical dimension) are far larger than 
those in the horizontal dimension. For this reason the vertical scale has 
been compressed by a factor of 10. so vertical and horizontal distances are 
not directly comparable. 

Evaluating the Hode! 

The model of T«|ble 2 provides less thin a complete description of th« 
mobility data in Table 1. Under the model of statistical independence the 
likelihood-ratio statistic is • 6167.7, which is asymptotically distributed 
as with 16 degree of freedom. With the model of Table 2 as null hypothesis 
^G^> 66.5 with 12 degrees of freedom, since 4 degrees of freedom are used to 

ER?C 
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create the five categories of H. Oy the usual inferential standards the <iiodcl 

does not fit, for the probability associated with the test statistic is very 

small. On the other h^nd the model does account for go.g percent of tlie 

2 

association in the data, that is, of the value of G under independence. 
Given the extraordinarily large sample size, small departures from frequencies 
predicted by the model are likely to be statistically significant. 

Exact tests of the difference between any two interaction parameters can 
be carried out in a straightforward way. Modify the modpl to combine the two 
groups to be contrasted in a single interaction level, and fit the revised 

model. Since the revised Riode I is a spetlil case of (nested within) the initial 

2 2 

model, the difference between the likelihood-ratio x statistics (G ) of the 

two models will be distributed as x with one degree of freedom. For exaiif)le. 

if I combine levels 1 and 2 of the present model, the revised .<iodel yields 

G^ - 676.3 with 13 df» so I reject the hypothesis lhat inmwbility is the 

2 

same in the farm and upper white collar categories with G = 676.3 - 66.5 = 
60g.8 with 1 df. 

By examining errors one can more fully evaluate the fit and perhaps see 
how to improve the model. Table 4 displays a measure of lack of fit for each 
cell of the mobility classification. It expresses errors as natural logs of 
the ratios o^ observed frequencies to those estimated under the model: 

<og(e^j) = log (Xjj/Wy) * Xjj - m^j. (5) 

where x^j is Mie observed frequency and m^^ is the estimated frcqueiKy in the 
ij*^*' cell. As long as the errors are small, say, less than ♦ 20. they can be 
interpreted approximately proportions. Thus, expressed in this way the 
errors have a convenient interpretation, and positive and negative deviations 
are expressed sywnetrically in the metric of the (loglinear) model, for ^ 



example, the entry or .06 In Ute cell (3*1) says the observed mobHIty from 

06 

upper manual to upper nonmanual occupations Is e* "1.06 tines the nobiMty 
estimated under the model. ^ The entry of -.17 In th« cell (2*3) says mobHIty 
from lower nonmanual to upper manual occupations Is e'*^^ * .84 tlfi«s the 
nobility estimated under the model. I.e., 16 percent less. As suggested by 
these two examples, the apprcximatlun Is better when the error Is small. 

mrrmsTfm 

Under the model of Table 2 cells (1,1), (2,1), and (5,5) are fitted 
exactly, each by Its own parameter. The fourth leveN-cells (1.2), (2,2), 
(4,4), (1,5), and (5,4)— Is also fitted closely. The largest deviation Is 
the 4 percent underestimate of movement from lower Manual to farm occupations. 
Each other deviation at level 4 Is less than one percent. The lack of fit 
In the model occurs primarily at level 5 of the design. There Is a positive 
deviation of .10 In the one diagonal cell (3,3) assigned to level 5, so 
Imnoblllty In the upper manual (skilled) occupations Is not quite so low as 
In soM other celK at the same level. At the same time the largest positive 
error at level 5 Is that for upward mobility from lower manual to ^ower noH- 
manual occupations. The two largest negative errors at level 5 pertain to 
the exchange between upper manuai and lower noniwanual occupations (cells (3,2) 
and (2,3)). (ven relative to the low density (presumed by the model) through- 
out level 5, there Is a blockaQe to movement between the skilled and lower 
whlte-^'^llir occupations. This Is more striking because there is no similar 
hindrance tir exchange between the sklMed and upper white collar occupations 
(cells (1,3) and (3,1)) or between the lower P^nual and lower nonmanual occu- 
pations (cells (4,2) and (2*4)). Trom the entries In Table 4 one might argue 
that the model and the tit could be Improved by crediting t sixth Interaction 
O to Inchide cells (1,2) and (2,3) and, possibly, (1,5), which Indicates 
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a very low rate of mobility from upper nonmanual origins to flrsl Jubs In 
farming. 

The IrKlexes of error In Table 4 are In a metric that facilitates Inter- 
pretation and comparison, but they take no account of Sampling variability, 
which Is Inverse to expected frequency. Perhaps the simplest way to take 
account of sampling variability In the errors is to form the ratio 

"ij 

which Is the square root of the component of the Pearson chi-square statistic 
for each cell of the table. The z^j are (roughly) interpretable as unit 
normal deviates.^ Since there are several more cells in the table (25) than 
degrees of freedom under the model (12), the expected value of is con- 
siderably less than unity. However, I have not made a correction for that 
here (see Bishop, Fienberg and Holland 1974:135-141). 

r 

Table 5 displays standardized errors from the model of Table 2. Again, 
one is Impressed with the close fit at level 4 and the heterogeneity at level 
5 of. the modrl. The interpretation of these errorjs must be tempered by the 
results in Table 4, for the standardised errors are not in the metric of the 
model. Taken in conjunction with earlier results, Tab^e 5 also suggests a 
respeclfkation of the model In which as a first step cells (2,3), (3,2) and 
possibly other negative outliers would be assigned to a separate level. 
However, because the sample Is so large, I have not carried the analysis of 
Table I beyond the model of Table 2. 

l*6iriWouXHrR| 
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HobUtty iUt<^ 

Ofie other Index Is partlculirly useful In evaluating the specification 
of Interaction effects. Froa equation 1, observed frequencies may be expressed 
In term of estl«ated paraawters and errors: 

/ « VjVij • 

Dfvlde both sides of equation 7 by the first three terms on the right-hand side 
to obtain 

I call r|, the new nobility ratio, or^ slwply, the aN>bl1lty ratio, in the 
case of diagonal cells R^j Is equivalent to the new IHMoblllty ratio proposed 
by GoodMn (1969a, l%9b, 1972c; also, see PuIIum 1975:7^8). but I suggest 
the ratio be computed for ail cells of the table as an aid fn the evaluation 
of model design. If the mdel Is specified correctly^ the estimated fnter- 
actions will be more useful In Interpretation than the R^j, for the 

fatter^HI confound Interaction effects with saa^llng errors On the 

^ other hand* when the model Is not correctly specified, the errors (e^j) will 
reflect specification error as well as sampling variability. For this reason 
the r|j can be useful In revising ^mod(*l which does not fit the data. 

To Illustrate the use o.' the K^y Table 6 gives these Indexes for the 

it 

counts of mobility to first Jobs. Obviously, the pattern of the R^j conforms 
lo lur earlier description of the design. Horcover, as may not have been 
ob^\^z from the (Table 3) and the e^j (Table 4) taken separately, the 

fit Is gold enough so there Is m> overlap In Interactions across levels 

t 

J I rtco^lied In the design. For example. If imroblllty among skilled workers-- 
^1 (3,3).. Is hlfh relative to mobility In other ceHs at level 5 In 
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Tdble 2, the inmobllity In tnat category Is still substantially less than 
the ImiobiHty In any other occupation group. Again, level 5 appears to be 
heterogeneous, but I have not carried the analysis of Table 1 beyond the 
model of Table 2. 

Conceptually, R^j Is related to R^j, Rogoff s (1953) social distance 
mobility ratio and Glass's (1954) Index of association: 



where N Is the |um of observed counts, and x^ and x j are, respectively, 
SUMS of counts \n the 1^^ row and In the J^^ column. Both R|j and R^j may be 
interpreted as ratios of observed counts to those estimated from a scale 
factor and row and column effects under a given statistical model (see Mauser 
l978:92:.-924). Indeed, R^j « R^j In the special case of the model of simple 
statistical Independence, which specifies no interaction effects. In terms 
of equation I, R^j beco«ies R^j when we specify ■ 1 for ail 1 and J. 

As a measure of interact ion> R^j has several undesirable properties. 
Although R^ was supposed to be Independent of prevalence (row and column) 
effects (Rogoff 1953:32), It varli nversely ?s the marginal proportions In 
the 1^^ row and J^^ column, ihe me urn of R^j Is the reciprocal of the 
larger of the marginal proportions In the 1^^ row and J^^ column. Also, tlie 
set of R|j for a square table determines the row and coluam marginal distri- 
butions. This renders R^ useless in comparing Interaction effects across 
tables with differing marginal distrlbutlohs, for the multiple sets of R^^ 
cannot take on values corresponding to the hypothesis of no change. Further, 
the Rjj cannot be syimietrlc across the main diagonal (R^j " Rj^ )--showlng. 



for exa«p1ie,^ftqttel propensities toiMrd upward an^ downward iH>ti] ity^'- unless 
the observed counts are sjMetrlc (x^j * Xj^. Thus, propensities toward 
upward and downward nobility cannot appear to the-sane unless the fre- 
quencies of upward and downward nobility are the sane, and, consequently, 
the two Marginal distributions are the sane (Blau and Duncan 1967r93-97, 
Tyree 1973). These undesirable properties all arise because, when the nodel 
of simple statistical Independence does net fit the data, R^j confounds 
prevalence effects (of rows and coIumis) with Interaction effects (GoodMn 
1969b). That Is. the InporUnt difference between R^j and R^j Is that the 
new nobility ratio is obtained fron a nodel that fits the data, jso row and 

colinm effects are not confounded with relative densities (interactions) In 

* 

the Interior of the table. For these reasons Ry does not have the undesir- 
able properties of R^j. In general, (1]f R^j is not bounded; (2) in a square 
table the set of r|j do not detemine the narginal frequencies (nor the 
narginal effects); and (3) the set of R^j can be symaetrlc, 1.e.« fty * Rjp 
under any set of narginal frequencies (or effects) (compare Tyree 1973: 
577-5aO). 

In this context it Is instructive to show the relationship between the 
paraneters of the nuUlpllcatlve nodel and the mrglnal frequencies of the 
nobility table. The nodel fits the observed narginal frequency distributions, 
* that 1s» J J * j •'wf I n^j - . so 

• I « • - m;j6,j ' x^ (11) 



and 
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Thus, the marginal frequency In a given column (or row) Is the product of 
the correspqnding cjlunn (or row) effect, a scale factor, and a weighted 
sum of the row (^r colu*i) effects, where the weights are the interaction 
effects for corresponding rows (or columns) within the given column (or row). 
Similarly, from equations 7 and 8 we can write 

JNl « " Vij^ij . « ^ij ViJ ■ N. • 



and 



(13) 



^0 one^nay alternatlvaly think of the net* mobility ratios as welohls in 
the exnresslons relating marginal frequencies to corresponding riarglnal 
effect*. There are expressions In the old nobility ratios, R^j, which 
are formally similar to equations 12 and 13; however, those expressions can 
be simplified to eliminate the R^^, while equations ]2 and 13 cannot be 
simplified to eliminate the r|^. For example, from the definition of R^j, 

'IJ " "ij • 04) 

fNj V J '^1.'^1j'\j . 05) 
^or by definition l x^ R^j » N. 

Suppose it were possible to solve for the marginal effects by writing 
linear equations In the rJj, so (following Blau and Duncan 1967:93-94): 
* 

I ^1 '^IJ " " J 06) 

ind r tj R|j • n for all f. (17) 

Inder these conditions equations 1? and 13, respectively, can he rewritten as 

24 
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X J » a Yj » (18) 

and ■ a $1 » . 09) 

TMt IS. U the iRoblHty ritlos dcteoiinc the nBrglrvsl effects, then x j <s 
Just a scalar Multiple of Yj. ^nd %^ Is Just a scalar multiple of a^, which 
Is to fiiply the Model Is Indljstlngulshable froM the Simple Independence Model 
ahd Is indlstlngulsh«uib from R^j. Byt in general the Model is not that 
of sfMple Independence; R^j is not equal to R^y and the row and coluMn para* 
' Meters are not scalar Multiples of the Marginal frequencies. This says that 
the roll and coluMn effects under the model are not generally detenained by 
the rJj. In sumaary. the ..i- Mobility ratios appear to have properties that 
Make theM useful in «»del specification. 

Wtobi Htjf_ Rati o s and Other Measures of Interaction 

Do the substantively novel^ features of w interpretation of nobility 
• to first Jobs Merely reflect peculiarities of ^he 1973 OCG data? Obversely, 
are those features t consequence of a different way of looking at the data? 
If the old Mobility ratios (R^j) 1>n)vide i^isleading clues about the structure 
of Mobility Ubles, are there valid measures which are easier to obtain than 
the new mobility ratios (r|j)7 

in this section I attempt to answer these questions by subjecting the 
1973 OCG data of Table 1 to a number of alternative analyses. In brief, the 
answers are as follows. The 1973 OCG table of mobility to first Jobs is 
generally similar to other e)bility tables, and any novelty in my conclusions 
arises from the use of my stnKtural model. Moreover, I have directed 
ay critfciSMS of Mobility indexes priMarily at the old mobility ratio, R^j, 
scvtral oMwr cmmm wasures of assoclitlort also fall to elucidate tlie 

ERIC 



pattern of associaiion in the mobSlity table for much the same reasons thai 
R|j is defective. Further, by obtaining new mobility ratios (R^j) under 
relatively simple models of quasi -independence (which are special cases of / 
the model of equation 1), 1 can diagn.^e the pattern of association without 
positing a model for the full table. In some cases on^ can obtain sound 
diagnostic information without extensive calculation. 

Table 7 shows standard outflow ano inflow tables based on the data of 
Table 1. The 1973 OCCf table of mohility to first Jobs appears to re- 
semble other mobility tables, such at the 1962 and 1973 OCG tables for 
mobility to current Jobs (Hauser am' Featheiman 1976; U.S. Department of 
Itralth, Fducation, and Welfare 1969). There is evidence of status per- 
sistence and self -recruitment; the latter is especially strong In the CA^e of 
farm occupations. There is also substantial short-distance mobil ity. Last, 
ets\t for the prevalence of lower manual first occupations—which is greater 
than the prevalence of lower manual fathers* occupations --the flow of Nian- 
power is primarily from lower to higlier levels of the occupational hierarchy, 

and there Is a marked decline In the proportion of men with farm occupations 

9 

relative to the proportion with Tarm origins. 

Again, Table 6 may be helpful In evaluallng other measures of association 
wlien the latter are presented in mul tipl !cative form. On^ previously unmen* 
tioned feature of the array of Ry is their high degree of synnetry across 
the main diagonal (with the marked exception of the interchange between upper 
and low\ir nonmanual occupations), ^his synmetry of upward and downward flows 
is not apparent in the inflow and outflow tables , nor are many of the other 
patterns of association In Table 6. One possible exception is tiie re>atively^ 
high degree of ipfiK>bility in the upper nonmanual and farm occupations . In 



any event t shall not pursue ti»s coifiarlsd^ of Tables 6 and 7, for neither 
the inflow ner outflow tables are pui'^rted to free the pattern of assoc1a< 
tlon froM the influence of both Hrginal dUtrlbutlons. 

Old ftobHUy Ratios 

Tahl^ ft »h> nifi ■ni.nuj ra t io s ( R ^j) fu i this data or labie I 



under the^mdel of sliaple statistical Independence, Clearly, one need not 
resort to hypothetical data to show the differences between Interpretations 
based on the^old and new aobllity ratios. The entries In Table 6 are slallar 
to these which; an experienced student of nobility has encountered on Many 
occasions; for exaaiple, coaipare PuUini's (1975:3-7) description of the 5 X 5 
British nobility table. From the It., one would conclude (correcti/) that 
Ihere Is substantial iMOblllty at both the top and bottom of the occupation 
hierarchy, blit not nearly as much iMnobllUy as Is Indicated^ by the R^.. The 
R^j also show status iRMioblllty 1r the three middle occupation groups, but 
less In the lower manual than In the other two categories. In contrast the 
R|j show a very low level of iMaoblllty In the upper manual group, and they , 
show moderate and roughly equal levels of limBoblllty In the lower nonmanual and 
lower manual groiH>^* Both sets of ratios show greater than expected Inter- 
change between the upper and lower nonmanual groups with the upward flow 
exceeding the down^^d flow. The >«y show asywmr.trlc flows between the lower 

manual and farm groups, both of which are below expectations, but between these 
♦ 

same two aroups the R^j show roughly equal flows which are larger than those 
expected from row, column, and scale effects. Ulth a single exception the 
R|j decline regularly as- one moves away from the main diagonal In any row or 
column,' but the r|j are low and fluc^'«ate irregularly outside the eight cells 
In 4he upper left and lower right corners of the table. Outside those same 
Q rs four of the R^j (In cells (3,2), (3,3), (3,4), and (4,2)) show 
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greater frequencies than expected, but none of the R^j show^ greater fre- 
quencies than expected, last, with a single exception the R^j are greater 
In size In corresponding celts below than above the main diagonal, and this 

.suggests a preponderance of upward relative to downward mobility. At the 
♦ 

jame ti me^ the R^j are jcouohl^ifae s a m e sl3f» 4 n corre sponding trelts dbo ~ 
and below the main diagonal with the exception of throne asymmetry in the 
specification of the model. 

The relatioiishiD between the R^j and R^^ can be clarified by expressing 
the former In terms of the latter. By definition 

,x H 

«ij • r^^Yy- 

Under the model of equation 1 and from equations 7 and 8 I write 

so N » E E Xn » b E E $i Yi . (22) 

^ J 1J 1 J ' ^ 

By substitution fronr equations 21, 22,^, and 13 I rewrite enuation 
20 as 



(«a, ;jr;.)(:. E^6^ ijRjj) % ^/i ij 

R|i ' « V (23) 



):YjR^j)(ciYj Jls, R^j) (J Yj R,j)(E6^R,j) 

The double sum in the numerator of equation 23 Is a scale factor which does 

not vary with the Indexes 1 and. J. Thus, the variable parts of the expression 
* 

say that R^j is related t9 R^j Inversely as the product of weighted averages 
of the column and of tie row parameters, whose respective weights are the 
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new aobllUy ratios In thf 1^^ row ami the J^^ coluam. In general R^j wHI 

* f K 

be low, relative to R^j. when the new nobility ratios In the 1 row and the 

th * 
J coluwi are large, and R|j will be lilgh. relative to R^j. when the new 

nobility ratios In the 1^^ row and the J^^ colum 4re SMll. For example, 
the relatively Urge value of Rjj, the old Inwioblllty ratio for upper non- 
namial (skilled) occupations. Is explained by the very low levels of associa- 
tion throughout the third row and the third colmm of the table (when that 

association Is Indexed b> R|j)* in general a given row and colusm need not 

* * 

contain only high or only low R^j, and the relationship of R^j and R^j will 

vary annng cells In the iiiob1!!ty classification. 

Standardized Deviates 

Are other expressions of residuals under the model of simple Independence 
more Instructive with regard to the structure of association In the mobility 
table? The upper panel of Table 9 shows standardized deviates under the model 
of statistical Independence (recall equation 6 and Tab!e 5). In looking at 
the standardized deviates we have left the (multiplicative) metric of the 
model. That Is. the standardized deviates are test statistics, and they 
reflect variations In the standard errors of deviations of observed from 
expected frequencies across cells of the table, as well as the pattern of the 
residuals themselves. Thus, we would expect the standardized deviates to be 
inore helpful to us In locating extreme outliers, as \n cells (1.1) and (5.5), 
than 'In evaluating the pattern of association In the table, in any event 
the pattern exhibited by the standardized deviates Is rather close to that 
of the old mobility ratios. Similar results would be obtained had we chosen 
to express the residuals as components of the likelihood-ratio statistic (3 ) 
as freemin^Tukey deviates (Bishop. Flenberg and Holland 1975:136-137; but 
O mtz Wb). 



2A 



T MIE 9 ABOUT 1J|_RJE 

Itaberman (1973) has suggested a transformation of the standardized 
levlates, which Is In our notation 

Che adjusted residual , d^j, has better asymptotic distributional properties 
than Z|j. and Brown (1974) has shown that d^j Is more effective than i^^ In 
Identifying a small number of outliers. The adjusted standardized deviates 
ire shown In the lower panel of Table 9. As one might expect the adjusted 
deviates are not more Instructive with regard to the overall pattern of asso- 
:1ation than are the unadjusted deviates or the old mobility ratios. At tlie 
»ame time they do clearly Identify three large positive deviates (In cells 
(I.I). (4.4). and (5,5)) whose elimination might elucidate the pattern of 
issoclation throughout the table; I shall elaborate this suggestion later. 

Parameters of the Saturated Logllnear Model 

Like the specification of equation 2. the conventional parametric rcpre> 
.entatlon of the logllnear model also oescrlbes frequencies In tenns of para- 
leters for row effects, column effects, and Interaction effects. Moreover, 
ine can ''saturate** the model by Including all main effects and Interactions, 
.hus fitting observed cownts perfectly. Critics have suggested to me that 
interaction parameters under the saturated logllnear model would yield, ty 
inspection aVone. substantially the same Interpretation as that obtained 
jsing iny model. However, the usual normalization of parameters of the log- 
Inear model gives priority to row and column effects relative to Interactions, 
.hat is. relative to association In the Interior cells of the table. Even ^ 
:hough the saturated logllnear model fits a table completely, this conventional 



norMilzatlon of parancters has much the saine effect on the pattern of 
fitteractlofi parameters as the aisunptlon of statistical Independence has 
on the pattern of old mblllty ratios. Consequently, under the saturated 
logllnear nodel the multiplicative parameters for the Interactions are no 
more Informative than the residuals from the simple Independence model: 
the marginal effects are too large In rows or columns where the Interactions 
art strong, and the marginal effects are too<;^mall where the Interactions 
are weak. Obversely. the estimated Interactions are deflated or Inflated 
relative to a model in which row, column, and Interaction effects are given 
equal priority. 

An algebraic presentation of the conventional logllnear model may 
clarify this argument. Let 

'ij "*IJ • 



ith 



where m.. Is the expected count In the IJ cell, and i: i: Mh 
U 1 J 'J 

saturated logllnear model says that 



The 



subject to the constraints 

Under these constraints the u-^erms are obtained by a row and coIukvi decom- 
position of the logs of expected frequencies paralleling that In a two-way 
analysis of variance with one obse^vatton per cell (Bishop, Flenberg and 
Holland 1975:24): 

u - -f J" . (28) 



and 



"12«J) • '<J - <"l{1) * "2(J)' * 



Note that equation 26 and equation 2 are Identical (excepting notation), and 

the constraints on row ard column parameters are the same In the two models. 

The livportant difference between the models of equations 2 and 26 lies In the 

* 

constraints oh the Interaction parameters (^*|2(^ "l2(IJ)' ^" 
Implications of those constraints for the specification of equalities among 
subsets of Interaction parameters. The specification that the interaction 
parameters, sum to zero within every row and within every column of the table 
(see equation 27) Is equlvalert to the model of simple Independence in Its 
implications for Interpretlrg the pattern of Association In the table. By 
relaxing the normalization or the U|2(|j) equation 27 one can obtain new 
Inslohts Into the pattern of a:>soclat)on In the table. 

This observation can be elaborated by writing the main effects In the 
saturated model (or equations 26 and 27) as functions of the main effects and 
and Interactions In equation 2. In the logllnear metric I substitute equa- 
tion 2 for l|j In each of equations 28 to 31, recalling the constraints on 
sm s of the u -terms: 

u - u* (32) 

"KD ^ "1(1) ' ;"l2(lj)^J 

32 
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-2{J) " "2(j) ' f^'iedj)'* 

"i2(u) • "mu) - 1; «i2(ij/J ' ; "Jadj/n ^ . (35) 

The constants in the two nodels are the %fmi the Mln effects of the 
nm differ by the average of the interaction effects in that row; the Main 
effects of the J^** coluwi differ by the average of the interaction effects 
in that coluavi; the interaction effects of the ij^ cell differ by a constant 
less the averages of the interaction effects in the i^^ row and the J^*^ 
coliM. note that equations 33 and 34 are (additive) analogs of equations 
10 and n, respectively. JmsI as the Marginal sum of frequencies in any row 
(or coluMnj varies with the interaction effects In that row (or co1uMn)« so 
the Main effect of any row (cr coluMn) in the saturated Model varies with 
the interaction effects in tiMt row (or colUMn). SiMilariy. equation 35 is 
an (additive) analog of iK|uation 23. Just as l^^j varies (inversely) relative 
to i|j as a function of the interactions in the 1^^ row and coluMn. so 
^UHi) ^•'"♦^^ (inversely) relative to the ^"l^iii) " * function of the 
interactions in the i^** row and J^*^ coluMn. If we regard equation 2 as the 
itrwctural Model » then the conventional row by coluam decoN^sltion of the 
satwrated Model yields Main effects and interactions which are Mixtures of 
HraiMters of the structural Model (cOMpare Goldberger ign or Duncan ig75:ISI). 

for exaMple* fable 10 presents the Multiplicative paraMeters for a 
yttwrated Model which fits the daU of Table K These tell essentially the 
saMt story as the old MObiMty ratios in Table 1, but they a^e a slight 
iMproveMent over the old Mobility ratios. Note that the paraMeters in cells 
(^^)^«iMf (S.S) are larger than the old Mobility ratios in Table 8, 4nd the 
ERXC^ paraMeter". of the sparser cells is less than that among Mobility 



ratios in the correspond! nq ''ells. Ihe iiiproveMent occurs because the row^ 
and column effects under statistical Independence are based on suimS of 
frequencies* while they are based en sums of logs of frequencies in tne case 
of the saturated loglinear model. The operation of talcing logs reduces the 
effect of positive outliers on the tow and column sums, and so reduces (but 
does not eliminate) the influence of the SMall number of large Interactions, 
I.e., of the positive sicew of frequencies* on the row ^ column effects. 
However* the array of parameters in Table 10 is still substantially misleading 
in respect to the pattern of association In Table I. for exdMple, it shows 
roughly equal iawobility in cells (2,2), (3,3), and (4,4) and it does not 
suggest the homogeneity of densities In cells assigned to levels 4 or S in 
Table 2. 

Adjustment to Uniform, Marginals 

I shall consider one other method of inspecting the pattern of associa- 
tlon in the full mobility Mble. HosUller (1968) drew attention to Levine*s 
(1967) use of iterative proportional rescaling to adjust British and Danish 
S X S Mobility tables to uniform marginal distributions. This adjustment 
facilitated Levine*s Interpretation of the Mobility tables and exposed simi- 
larities In the pattern of association in the two tables. (For further 
evidence and discussion of the si«narity of the British and Danish tables 
see Goodman 1969a* 1969b and Bishop* Fienberg and Holland 1975:100.) The 
iterative proportional rescaling procedure U generally attributed to Deming* 
who suggested it be used to adjust sample cross-classif ications to Icnown 
marginal distributions (Scheuren and Oh 1975); the same procedure is used to 
obtaiu maximum-liicelihood estimates of frequencies in loglinear models of 
contingency tables wht^n no .ijsed* font estimates exist (Bishop, Fienberg and 
Holland 1975:83-87). The method has been applied fre(itently in recent years 



(Duncan 1966; T>r«« and Trefs 1974; Gamier and Mazelrigg 1974, 1^76; 
Hizclr499 l9/4a, 19746; llauser et al. 197Sa; Pulluw 1975 ; Mauser, Featherwan 
and Ho^n 1977). 

The adiusUient procedure Is stralghtfomard. Each row entry is iwlti- 
plfed by the ratio of the desired row sun to the actual mi sum. Then each 
colMi entry Is Miltlplled V the ratio of the desired coIumt sum to the 
actual coluaw sun. By alternating row and column adjustments, convergence 
of the adjusted cell counts to both dcslrtd marginal totals Is usually 
obtained within a few Iterations. Multiplicative adjustment preserves the 
initial pattern of association In a Uble because odds-ratios are Invariant 
to scalar transformations applied uniformly across rows and columns, for 
txan|>1e. the upper panel of Table M gives hypothetical frequencies in a 
2x2 mobility table. The odds*ratfo In this table is 

*ll'*l2 . *l.l'*21 , 'll '22 (35) 
M^jTi^ ^12^^ "21 *12 

Suppose the frequencies In Interior rows I and 2 of the table are multiplied 
by arbitrary non-iero constants, say, a and J), respectively, likewise, the 
frmquencles In the InUrlor columns of the Uble are multiplied by non-zero 
constants c and d. The adjusted frequencies are shown In the lower panel 
of Table II. Under this transformation the marginal proportions will not 
generally be preserved, but the odds-ratio will be unaffected, for 

(ac M^,)(bd K„) ^ «n «22 P;) 

As ear!y as 1912 Yule recognized the desirability of constructing measures 
of association with this Invarlance property (Goodman and itruskal 19S4:747). 
Jnjact both the old and the new {R*^) mobility ratios have this 



Irtvariance property. I.e., that they preserve the odds-ratios In the observed 
frequencies; this Is obvious from Inspection of equations 20 and 21. 

In recomnendlng adjustment to uniform marginals Hosteller (1968:8) 
implied that the adjusted frequencies elucidi ted the pattern of association 
in a table: 

...we can Interpret the resulting numbers as transitional or 
conditional probabilities expressed in per cents— either son's- 
distribution oiven'the father's category, or father's given 
the son's... In the sense cf having a common nucleus of asso- 
ciation... It would be fair to say that the two occupational 
tables are nearly equivalent. 

Sifpilirly, In respect to the same example. Bishop, Flenberg and liolland 
(1975:100) write 

By comparing diagonal values we see that, except for category 
1, the tendency for fathers and sons to fall into the same 
category Is stronger in Denmark than Britain. By looking 
across rows w^ can see, for fathers in each category, in which 
country the sons are more mobile. 

Again, Flenberg (1971:308) suggests chat standardization to uniform marginals 

permits one "to look at the asscclatlon or interaction, unconfounded by the 

two sets of marginal allocations." 

While it is strictly correct that the original marginal frequencies of 

a table cannot be deduced from the set of frequencies adjusted to uniform 

marginals, neither do the adjusted frequencies display the pattern of asso- 

elation in the sense intended here. For example. Table 12 gives the adjusted 

frequencies of the data in Table 1. I chose uniform marginal sums of S, 

so the condition of tlMple Independence would yield an entry of unity in 

each cell (compare Tyree and Treas 1974). The pattern of marginally adjusted 

frequencies In Table 12 is vlrtuaMy identical to that of the old mobility 

ratios In Table 8. and it is markedly different from the pattern of the new 
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^4»tU41y- ratios In TAblfi_^_ Qnr Jjftrpvftwnt «s tliat the adjusted frequencies 
•re H>re nearly symetrlc about the aialn diagonal than are the old Mobility 
ratios. If the aarglnal adjustment ellnlnates varUtlbn In the Marginal 
distributions, why doesn't It uncover the underlying pattern of association 
in the table? The problem Mes In the distinction between aiarginal dlstrl- 
butiofls . X j) and marginal effects ($^, tj); recall equations 10 ard II. 
Fquallzatlofi of the Marginal distributions does not equalize the Marginal 
effects; the fonuer differ fro« the latter because they are confounded with 
fhe underlying pattern of interaction in the table. For exampie. consider 
again the third rom or the third colurm of the Mobility classification (upper 
Manual occupations). Because the interactions #re weak in that row and 
column* tne Marginal proportions are relatively low. Consequently, the 
adjustment procedure induces too large a relative increase In the marginal 
frequencies in that row and column, leading to an excessively large adjusted 
entry in the (3,3) cell.^^ 

~ STMTTar1yrin~anaTyring the British and Danish Mobility tables Levinc 
(1967) adjusted the frequencies to uniform Marginals, took logs of the 
aHJusted frequencies^ and fitted SMOoth curves to the logs of adjusted fre- 
quencies, levlne's Model Is flawed because the itfitlal MJltlpllcatlve 
adJustMent did not reveal the pattern of association In the British and 
Danish tables. However. If adjustment of the full table to uniform marginals 
does not yield the Interaction structure, neither Is it valueless. The 
procedure can be used as a rough guide to similarity or dissimilarity in 
tht^o^-ratio of the two or more tables, even though it doe> not provide 
a satisfactory picture of the pattern of association in any one classification. 
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In suiiHary, have evaluated several measures of association which 
are based on the model of simple IndependenceTnd^ other measures which' are 
based on the saturated model. There are real differences among these 
measures of association. For example, }laberman*s adjusted deviates are 
useful In locating a small number of outlying frequencies. When the inter- 
action structure <ls •'ynnetric but the marginal distributions are not, the 
marginaTy adjusted frequencies or the multiplicative parameters of the 
saturated model will display the underlying symnetry. but the old mobility 
ratios will not. At the same time, each of the measures I have reviewed 
suggests essentially the same interpretation of the pattern of association 
In the mobility table. This Interpretation is In each case fundamentally 
different from that suggested by the new mobility ratios (R^j)- This differ 
ence occurs primarily because the other measures of association confound 
main effects (of rows and columns) with interaction effects. 

Hodel Specificat i on Under Quasi -Independence 

Onr can obtain superior insights Into the structure of association in 
-a-table fay te mpor a rily igno ri ng those cetls of the classification which 
contribute most to the confounding of Interaction effects with row and 
column effects. In Goodman's (1965, 1969a) terms one "blanks out" those eel 
and fit models of quasi- Independence to the remaining cells. Cquivalently, 
In my multiplicative models for the full table, I fit one parameter to 
each cell which is to be ignored, and I assign the remaining cells to a 
single level of the design matrix. Table 13 shows the equivalent design 
matrices for three models of quasi -Independence In the 5 x 5 table. 

TABU 1 1 ABBUTTlTRr 
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Model Q1 Is the qu«si-perfect nobllUy K>de1. It ignores (or fits 
•x«ct|jf) the frequencies ofi the oaln d1a<ipna1» which represent occupational 
Inheritance relative to the f1ve-caie9ory occupational classification. Under 
the u11 hypothesis there Is no association In the remainder of the table, 
irfilch Is co^d at Ijrvel 1 In the design natrlx both for the full and the 
partial tables. Frequencies are estlpated In those cells byjteratlvely 
fltt1n9 a Matrix containing ones at level I and zeros elsewhere, I.e the 
Ql design mtr\% In the left-hand colMtn of Table 13, to the observed 
Marginal frequencies In the fitted cells. That Is, the fitting procedure 
preserves both the observed Marginal frequencies and the hypothesized lack 
of association In the fitted portion of the table. Uhlle there are 25 
degrees of freedoM In the 5 x 5 table, we lose nine degrees of freedoM In 
fitting row, coluMn. and scale effects, and we lose another five degrees of 
freedoM 1^ fitting the six-level Ql Model. Thus, under the null hypothesis 
that thei4 Is no association off the Main diagonal, there are 25 • 9 - 5 * It 
decrees of freedoM for error. 

Tanle 14 suMnarlzes the fit of MaxiMUM Ijkellhood estlMates of the 
lodeperm^ Model and other Multiplicative ModeU of the 5 x 5 table of 
Mobllltlf to first jobs (Table I). Clearly ModeniT accounts forawch^of 
th« assjoclatlon In the table. Uhlle 6^ • 683.06 Is still very large relative 

i ' 2 

to Its degrees of freedoM, It Is only about one-ninth the value of G under 



siMple Indeftendence. further, whll^ the slwple ^Independence Model mIs- 
* allocaites 20.1 percent of the Joint distribution of fathers and sons (as 
Indlcfted by the Index of dIsslMllarlty, a, In the fourth coluMn of Table 
14)» Model Ql Misallocates only 5.5 percent of th^ Enervations. 
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The first panel of TaBle 15 presents ratios of obse'^ved fretjuencles 

c 

to those expected at level I of Model Ql . I.e.. within the zone of quasN 
Independence Specified In the design matrix. Although the diagonal cells 
arg not assigned to level Tof Model Ql , we have also shown ratios c^ observed 
to expected frequencies In the diagonal cells. The diagonal entries are the 
Indexes of ^iMrooblllty proposed by GoodMan (1969a, I969b)rthey are rattos 
of the observed frequencies to those frequencies which would have been 
estlMatect In the Main diagonal If the quasi -Independence hypothesis held In 
the Main diagonal. Alternatively, we May say they ^r-i the frequencies 
predicted by the row, coluMn. and scale effects at levei I of the deslan. 

Those expected frequencies are not produdid directly by the cpmputer 
prograM (ECTA) used to estimate Models Ql . Q2. and Q3, but with a sln^l^ 
Model In a SMall table ^'U Is convenient to estimate tho expected frequencies 
In those cells fron the expected frequencies In the level wliere quasi- 
Independence Is presumed to hold. Under the null h^thesls all of tlie oiJo:- 
ratl^'A within the zone of quasi -Independence are' equal to unity (Goodman 
1968. 1969a). thus,\lf we know only three expected frequencies In a 2 x 2 
sub table of the full taMe^^we^an sotve for the fourth expected frequency 
by setting the odds-ratio In the^^sSCt^firtT^quaT^o^^^ For example. Table 
16 shows tlil^expeeted frequencies In each cell of the woblHtyH^le under 
Model Ql. To obtain the expected frequency In cell (i.l). we ^ould \fc^Jthe 
entries In cells (1,2). (3.1). and (3.2) to write 

754.9 (344.1) 

m , . 3 (3a) 

" 697.7 

Other combinations of cells could be used to obtain the ^ame estimate within 
the limits of rounding error. In models (like Q3) where a relatively large 
number of cells has been Ignored. It may take a certain amount of Ingenuity 
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to fill In the expected frequencies for aM of the blanked-out cells. 

The ratios of observed to expected frequencies In Table 15 are not 
new mobility ration (R*j), but they differ from the rJj only by a scc'ar 
Multiple. That Is, I havfe expressed the rJj as deviations froe. a scale 
factor (or grand wean) for the full table, but the ratios In Table 15 are 
expressed as deviations from expected frequencies at level 1 of the design. 
The relationship between those ratios an^' the R*^ Is like that between the 
nofMllzatlon of parameters In dumbly- V(^rlable regression and In multiple 
classlflcatfon analysis. In the former case the reference point Is the 
effect of one c;itegory.of a qualitative regressor» and In the latter case 
the reference point of effect measures Is the grand mean of the dependent 
virlable. With the understanding that a change In normalization has occurred, 
I shall refer to the entries In Table 15 as mobility ratios. 

IMer Hodel Q! the mobility ratios show a pattern of association which 
Is somewhere between that displayed by the l^^j (Table 8) and the r|j (Table 
6). Relative to the R^j, the ratios In Trible 15 are larger In cells (IJ) 
and (5,5), and they are also relatively larger In cells {4»5) and (5,4) and, 
to a lower degree. In cells (1.2) and (2,1), The ratios for HodeS 01 do 
not appear to fall as raf>ldly as one moves away from the main diagonal as 
do the R^j. At the same time there Is still a relatively high ratio In the 
central diagonal cell, (3,3). 

The fit of Model Ql Is not very close, and there are relatively large 
mobftUy rat!os In four of the cells which were not fitted exactly under Model 
fll-(1»2). (2.1). (4,5), and (5.4). For these reasons I writt the design 
%iir\n of Model 02 to Ignore those four c-^lls as well as the diagonal cells. 
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Ihus, Model 02 has only seven degrees of freedom for error under the null 

hypothesis. White (1963) and Pulluni (19/5) advocate models of this form; 

also^ see flenberg (1976) for an evaluation of this specification as applied 

by Pullum. The fit Is much Improved under Model 02. so one would expect the - 

2 

reJuuals to be more informative. Under Model 02, G ^ 50.05, which is only 
0.8 percent of Its Value under simple independence. The model misclassi fies 
only 1.4 percent of the Joint frequency distribution of fathers' and sons' 
occupations . 

The mobility ratios for Model 02 show a pattern which Is far oiore like 
that of the R^j. One problem is the relatively high ratio in cell (3,3), but 
there is otherwise little variation in the ratios outside the intersections 
of roWs 1 and 2 with c<)lumns 1 and 2 and of rows 4 and S with ^lumr^s 4 dnd 
5. Moreover, the change In specifkatlon has again Increased ^he mobility 
ratios in cells (1,1) and (5,5). 

I make one other effort to fit the data wore closely without precluding 
the estimation of all of the row and colt n effects, that is, without making 
it Impossible to obtain mobility ratios for all of the cells in the table. 

In Model 03 I ignore all of the cells on the main diagonal end on the 

2 

adjacent minor diagonals. Ilere the fit is rather close with G « 15.7 with 
three dt.HjiC''^ of freedom and A « 0.6. Of course, in obtaining this fit ! 
Ignore (or fit exactly) the cells containing about three*fourths of Mte 
observations, but my purpose Is not to fit the data both closely and parsi- 
moniously. Rather, I am trying to explore association it\ the table by 
fitting U closely In a way that permits me to- obtain diagnostic measures of 
association* 

Clearly, relative to the standard set by the pattern of R|j In Table 6, 
Model 03 is very helpful In uncovering the pattern of association In Table^^ 
The mobility ratios In Panel 03 of Table 15 show all of the major features of 
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the dUplay \n Table 2 and of the R^j In Table 6. Taken in conjunction with 
m review of other residual messures, the lesson in this illustrative 
analysis should be clear. Diagnostic or exploratory analysis of a classifi- 
cation will often be in^roved by ignoring large parts of the cljssif ication. 
It my be better to ignore too Much than too little of the classification, 
provided one is left with enough information at the end to construct diagnosMc 
Measures for the full table. In the present case, did I not^wish to show 
the evolution of the array of mobility ratios under successively improved 
specifications. I would specify Models Ql , Q2, and Q3 in advance and loolc at 
the Ability ratios only under Model Q3-because it fits well -as a guide to 
specification of a more parsimonious model. In fact, after nroupim cells with 
similar mobility ratios under Mo'Jel Q3. I wrote the specification in Table 2 
by inspection. 

In this context it is instructive to recall my earlier comparison of 
constraints on parameters of the saturated model with constraints on para- 
meters in my multiplicative model. In the former case parameters for row-by- 
column interaction ere constrained within each row and column of the table; 
that is. the sum of interaction paraweters is zero (in the loglinear model) 
and the product ol interaction parameters is unity (in the multiplicative model) 
within each row and within each column of the tabJe. In the latter case a 
similar constraint holds over all cel5s of the classification, but not within 
each row and column. The quasi-independence model, liice our multiplicative 
model, provides superior diagnostic insights because it, too, does r .>t con- 
strain interaction parameters within rows or columns. This is easy to see 
if we consider the array of expected frequencies under the quasi-independence 
mode^ Within the zone of quasi-independence there are no row-by-column 
Interactions. In the remainder of the table the observed frequencies are 
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fitted exactly with multiplicative parameters defined by the ratio of observed 
frequencies to those expected frofn row and column effects within the quasi - 
independent zone of the table. Given any set of expected frequencies (and 
row and column effects) in the quasi-independent zone of the table, ttie 
remaining frequencies (and their corresponding interaction parameters) can 
vary freely, so clearly there are no constraints on the interaction parameters 
within rows or columns. The absence of row and column constraints on intpr- 
action parameters, shared by my model and by quasi-independence models, leads 
in both instances to improved diagnostic and interpretative insights into 
the mobility table. 

Model Specification by Median Fitting 

Throughout this exposition I have relied on maximum- 11 Ice 11 hood .estimates 
obtained by iterative proportional rescaling, but computationally sim()ler 
nethods may suffice, especially in the analysis of small tabl^^s. To illus> 
trate, 1 repeat the estimation of Model Q3 by median fitting. For a full 
treatment of such methods, see Tukey (1977: Ch. 11). Panel A of Table 17 gives 
the natural logs of observed frequencies in. those cells of Table 1 which are 
fuasi-lndeoendenl under Model Q3. In the last column 1 show row means obtained 
IS the first step in the analysis, deviations of the entries in Panel A from 
the row means are carried forward to Panel B, and in the last row of Panel B 
I tatce column means. Panp^ C shows deviations from the column means In Panel 
), and here I begin a similar analysis by medians. In Panels C though I, I 
ilternate the extraction of row and column medians until, at Panel 1, the row 
tiedians are each rather close to zero. In rows or columns containino an even 
lumber of cells I tatce the midpoint of the two central observations as the 
ledian. Tiie solution requires minimal calculation because most of the medians 
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can ascertained by Inspection; beyofid the Initial coMputatlon of neans 
subtraction Is the Mln arlthnetic ope^itlon. 

TABLE M AeoOniERE 

fn Table 18 1 cwMilate the several row effects and the several coluMn 
effectsjiHiTned In Table 17. The resultant roM and coluiwi sum are the 
tstlMtes of naln effects under Model Q3. The roM effects are larger (absol* 
utely) than the coluMn effects because they I :lude the grand mem of the 
observations. In Panel A of Table 19 I use the row and coluam effects to 
estlMte logs of frequencies In the cells of Table 1 which were Ignored In 
fitting Hodel Q3. For example, the estimate !n cell (1.0 fs S.2I9 ^ .337 - 
5.556. In Panel 6 of Table 19 I asseaible the reslduaU from the interior 
cells In Panel I of fable 17 together with the devlatlo»;< observed fro« 
expected log frequencies In the cells lgr»ored «n fitting theaodel. Panel 
C shows the antilogs of the entries In fanel B» which can be Interpreted as 
the aoblllcy ratios In Table IS under Model Q3. Obviously^ hqt <rude winual 
fit does not exactly reproduce the nobility ratios obtained from the Maxliun- 
lllcellhood estimates, but the pattern Is close enough for diagnostic purposes. 

In using exploratory methods, like those Illustrated here» one always 
runs the risk of overfltting data; that 1s» one may model features of the 
data t4ilch occur only as a result of sampling fluctuation. The surest 
protection against overfltting Is Independent validatloOt the test of a 
model against independent observations. It may also help to smooth the 
statistical 1y» e.g., irj averaging across popuUtlon subgroups or by 
^Ql Mng soi*a categories of the row and column classifications. I have 
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illustrated thesj nicthuds elsewhere (Mauser 19;9), and I shall not elabordle 
them here. Rather, It may be more instructive to consider other data to 
^ch^4fae-^r»sent met!iods may fruitfull^-be-^applliML ___ 

5J blin g Re semblance in E ducational Attainment 

In the Wisconsin Longitudinal Study of Social and Psychological Factors 
in Achievement (Sewell and Hauser 1980) more than 9»000 male and femalp high 
school graduates were interviewed In 197S'-I8 years after high school gradua- 
tion—and In roughly 2,000 cases a randomly selected sibling was interviewed 
in 1977.^'' Panel A of Table 20 gives the counts in a classification of self- 
reports of educational attainment by respondents (in the 19^5 study) and 
their siblings (In 1977)^: The association between these two variables should 
be if)dicative of the strength and manner in which social, psychological, and 
genetic factors in the family of orientation lead to similarity in the educa- 
tional attainments oi siblings. There is an inherent symmetry in the way we 
look at this table. Neither sibling's education is causally prior to that 
of the other; rather, we think of both as determined by exogeneous factors 
In their families and in the larger society. In 1ookir(|| at these data, our 
main interest lies In the Joint occurrence of each pair of educational out- 
comes, and we want to abstract the density of each Joint occurrence from the 
widely varying relative frequencies of levels of schooling. 

By inspection of Panel A of Table 20, it appears that the symietry In 
our Interpretation of the table is reflected in the counts. Notice that 
corresponding row and column sums are very similar, as are corresponding 
entries, x^j and Xj^. across the main diagonal. This symmetry ajipears also 
In corresponding row and column percentage distributions, shown In Panel 8 
and Panel C of Table 20. Formally, the hypothesis of symmetry says m^ 
for all 1 / J: the maximum- 1 ikellhood estimates of off-diagonal frequencies 
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•re Just ■ Wj^ » (x^j ♦ '^j^)/^- Diagonal frequencies art not constrained 
by this .»del, «> ai^^ • x^^ for all i. Under this hypothesis the test statistic 
6^ is distributed as tiith ICU-l)/2 df (Bishop, Flenberg and Holland 1975: Ch. 

Panel 0 of Table 20 gives the naxiMjn- likelihood estiaiates; the wodel fits 
farther well, for 6^ « 11.59 tiith 15 df. Panel E gives the row percentage 
distributions of the synwetric counts; these are of course the saiae as corre'- 
ponding col4Mn percentage distributions. Most persons cowplete only 12 years 
of schooling, but there Is a i^condary mode at 16 years--the completion of a 
4-year college. Educational attaiw»ents of siblings are positively correlated, 
but even anong pairs where one sibling attended graduate school, the chances 
were about equal —30 percent— that the other sibling o.ily graduated from high 
school or only graduated froat college. 

Wiile the finding of sywwetry greatly slaiplifies the interpretation of 
the data, one might hope to find yet a more parsimonious pattern of interaction 
underlying the observations. Panels F, 6, and H of Table 20 show the vd!yes 
of three of the indexes of interaction that were discussed above (computed 
from counts in Panel A). As shown in Panel the model of sirple independence 
fits poorly. The ratios of observed to estimated frequencies suggest little 
tendency for siblings to share in completion of grades 12 or 13. but there 
is a stronger tendency toward Joint completion of grades 14 and grades 16 or 
17*. Further, the ratios are curiously high in cefls (13, 15) and (15. 13). 
Panel G gives parameters of the saturated model. The parameter in cell (12, 
12) is now the largest in the table; this shows how the corresponding entry 
in Panel F was depressed by the large relative frequencies of high school 
graduation among respondents and siblings. There remain tendencies toward 
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Joint completion of grades 14 and 16 or 17^ but these are not so large as 
in Panel F. Again, the relatively large entries in cells (13, 15) and 
(1^. 13) are surprising; those two levels of schooling are neither adjacent, 
nor are they typical points of iermlnatfon In the educationaT process. 
Panel M shows counts adjusted to uniform row and column sums of six, and 
these show substantially the same pattern as the parameters in Panel G. 
All three of these sets of indices show i general tendency for the values 
to fall as one moves away from the main diagonal (aga^n, excepting cells 
(13, 15) and (i5, 13). 

Panel I specifies the interaction parameters In a very simple multipli- 
cative model which is based upon some--but not all of our observations about 
Panels F. G, and H. The model says that there are equal tendencies of 
siblings to share in the completion of grades 12, 14, and 16 or 17^. litese 
ate the major tt^rmination points of schooling in the United States. Other- 
wise, the model says, there are no tendencies toward association or dissocia- 
tion between the educational attainments of siblings. This model fits the 

2 

data of Panel A rather well, yielding G - 22.59 with 24 df; the model uses 
only 1 df for interaction. As shown beneath Panel J of Table 20, the estimates 
say that sibling pairs are 2.84 times more likely to complete 12, 14, or 16 
and 17^ years of school s to complete any other combination of years of 
schooling. Panel J displays the products (R^j) of parameter estimates and 
residuals under this model. These generally confinn our inference about the 
fit of the model, but there may be a slightly lower tendency than estimated 
for siblings to cluster at grade 14 and a slightly higher tendency for them 
to cluster Li grades 17^. In any event, the overriding feature of the data — 
which is not accessible by inspection**is the tendency for siblings to share 
in the completion of major segments of the educational process. Otherwise, 
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Umrt Is little or no temlefic> tiMird slMlUrlty In Iheir educational 

• ttallMRfltS. 

The daU of lable 20 are fitted well, also, by a Model that Incorporates 
tMD further sfupllflcatlons: (U that the nain effects are Identical In 
corresponding rows and coluans, and (2) that the Joint density of siblings' 
•tieiMMnts at aMjor educational transitions Is precisely three times Qreater 
than densities elsewhere In the table. 

The first of these Cwo hypotheses -HMrg I na I homogeneity-- Is suggested by 
the earlier finding that the cowits are synnetrlc. SyiMtry Incorporates two 
distinct hypotheses. First, It says that Interaction effects are equal In 
corresponding cells above and below the main diagonal, so for example, « 6^^ 
for all I and J In the wod^] of equation I and ^i^jij) * "i2(J1) ' 
J In the model of equation 26. This hypothesls--called quasl-syMwtry— Is 
already ImpUcIt In the specification of parameters in Panel i of Table 20. 
Second, symmetry also Implies that main effects are equal In corresponding rows 
and column, thai ts. row and column marginal distributions are homogeneous. As 
noted earllert the combination of quasl-syenetry and marginal homogeneity Implies 
equality In population counts In corresponding cells above and helow the main 
diagonal, that Is, m^j - m^^ for all I and J. 

Models of symmetry, quasl-syummtry, and marginal homogeneity can easily 
be fliied and evaluated using a 3-way array composed of the original table 
end Us transpose, let I » the original row variable, 2 = the original column 
variable, and 3 « the transposition variable (whose values specify the original 
table and Its transpose). Sywetry Is Imposed by fitting th^ marginal config- 
urations (I2)(3), leaving MK-I)/2 degrees of freedom for error, quasl-symaetry 
Is fmposed by fitting the marginal configurations (I2)(13}(23), leaving 

*l)(K*2)/2 degrees of freedom for error. The goodness-of-f It test statistics 
O 

^J^I^kI by this set-up should be divided by two because of the double entry 
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of each count; equlvalently, each count In the array may be divided by two. 

The difference between the test statistics under symmetry and quasl-syiwnetry 

yields a test of the hypothesis of marginal homogeneity with K-l degrees of 

freedom, one for each distinct constraint on row and column effects. 

for example. Sr. the data of Panel A of Table 20 we have already noted 

that symmetry yields a llkellhood-ratio test statistic « 11.59 with IS df. 

2 

Under quasl-sypnetry , we obtain G * 6.95 with IC df, so the test of marginal 
2 

homogeneity yields G « 2.64 with 5 di . Thi'S, tlie data are consistent with 
both of the hypotheses subsumed undct :iymiietry. 

The second simpllfic tlon Is suggested by the model of Panels I and J of 
Table 20; the Joint density of observations Is roughly three times greater In 
cells (12,12). (14,14), (16,15), (16,17), (17,16), and (17, !7) than elsewhere 
In the classification. I specify a model In which the density Is precisely 
three times greater In the selected cells by fitting 9nly the row and column 
marginal configurations (as under simple statistical Independence) to a table 
of starting values In which 3s have been entered In the high density cells and 
Is In all other cells. In order to estimate the simplified model--! ncorpor at Ing 
syM»«try and three-fold density--! modified the starting values In the set-up 
used to test symmetry and quasl-synmetry and fitted Just the univariate marginal 
distributions for rows, columns, and transposition. This simpi If led aodel fits 
the data very well; ti^ » 25.99 with 30 df. That Is, In the simplified model 
I estimate only six parameters* one for the sca!e factor (total count) and 
five for the row/column effects, finally, on the basis of this analysis t1»e 
test statistic may be partitioned Into four additive components: (I) departures 
from quasl-syamietry (G^ « 8.95 with ID df), (2) other departures from the model 
of differentia! density at major schooling transitions (G^ - 13.64 with 14 
(3) departures from marginal homogeneity (G^ « 2.64 with 5 df), and (4) 



depirtures froii three-fold densHy (G * .76 wUh i df). None of these 
coHponents of error approaches stitlstlcal significance. ^ 

9^c«4>atfpnaj Siaillarlty of Friends 

In Table 21, Panel A shows counts In a classification of the occupa- 
tions of a sanple of Detroit mo and their friends. The data were obtained 
froM a saaf^le of 1,000 men who were asked to naiae and to describe the occu- 
pations of their three best friends. Since the saople clusters friends 
within respondents* there are In effect fewer than the 2,073 nominal observ- 
ations suggested by the sua of counts » but I have Made no correction for 
this lack of Independence'. 

As shoMi In Panel B, respondents choose friends whose occupations 
resenbte their own. Between 41 and 46 percent of the nominations from each 
occupational group fall within the same group. Panel C shows the ratios of 
observed frequencies to those estimated under sinple Independence. In the 
source, Jackson {}97t:S3) refers to the diagonal entries of Panel C as 

self-selection ratios; he conmie.its that self-selection Is highest among 
upper Mhlte collar men and least among blue collar men, while friendship 
declines with social distance throughout the table. As shown In Panel C, 
the model of simple Independence fits these data very poorly (G^ ^ 778.17 
with 9 df ), and for this reason the ratios In Panel C are quite misleading. 

One might expect to find a certain symivetry In the table, for it Is 
difficult to see how affinity between occupational categories as such should 
depend on the direction of choice. At the name time the counts In Panel A 
of Table 21 are clearly not syimetrlc^ for men at the top of the occupational 
^^9^xhy must choose friends cf the same of lower status, while those at 
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th"^ bottom must choose friends of the same or higher status. The model of 
symmetry, which fit In the previous case, must be reje<;ted here (G « 42.46 
with 6 df). However, a weaker model, quasi -synnetry (Bishop, Flenberg and 
Holland ig75: Ch. 8), does fit these data (G^ = 2.25 with 3 df). The model 
of quasi -symnetry, like that of syimietry, says Interaction effects are the 
same In corresponding cells above and below the dla^nal, but it does not 
add the constraint of homogeneity In corresponding marginal effects. Thus, 
under quasi -symmetry m^j may differ from m^^ because the 1^^ row and the 1^^ 
column effects may differ and because the J^^ row and the J^^ column effects 
may differ. Panel D of Table 21 gives the estimated frequencies under quasi - 
synmetry, and these were used In the next stage of model selection. 

Since theory and data suggest that observations cluster along the 
diagonal, I fitted a quasl-lndependence model that Ignored those four cells. 
The starting values for that model are shown In Panel E and, following the 
methods used In Table 15, ratios of observed to expected frequencies are 
given In Panel F. The model does not fit well, but It Is a great Improvement 
over simple Independence. The new self-selection ratios suggest that wUhIn* 
group choice Is high at either extreme of the statits hierarchy. Moreover, 

when out-group choice occurs » white collar men choose other (higher or lower 

i 

status) white collar men» and to about the same degree blue collar men choose 
other (higher or lower status) blue collar men. White collar men are much 
less likely to choose blue collar friends, and vice versa. When the blue- 
collar/white-collar line Is crossed therejs little status differentiation 
In choice within the out-group. 

Panel G of Table 21 specifies a model with tiie features which appeared 
In the ratios; In Panel F, and Panel W displays the relative densities (R.J 
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under this model tn the original table (Panel A^. The MCMie] of Panel G 
fits very wel^l (G « 6.46 with 5 df); ■oru'over. a very sinple pattern appears 
In the paraMeter esttnates under this Model (shown beneath Panel H), namely, 
that the estimates are In the ratios 1:2:3:4:5. With this as a clue. In 
the final model. I specif led ^t^at relative densities would have the hypo- 
JJieiUed^vAlitfs l^y^llUljiQ. the row. and column marainals of the observed ' 
table to the starting values shown In Panel 1 of Table 21. Note that the 
entries In Panels 6 and I are identical, but the latter are hypothesized ' 
parameter values, and the former are arbitrary subscripts of variables. 
Since the parameters are specified as constants, the final model (nominally) 
has 9 df. and the fit Is very good (6 » 7.63); little Is lost by Imposing 
the constraint of linearity on the parameters. Plainly, my final model 
of the Detroit data differs substantially from Jackson's analysis by Inspec- 
tlon of departures from Independence. There are different tendencies toward 
self 'Select Ion In each occupational grr p. and these are greatest at the 
extremes of the status hierarchy. Wl>en out-group friendship occurs. It tends 
to be amon^ white collar men (for white-coMar choosers) and among blue-collar 
men (for blue collar choosers). Uhltc collar men rarely choose blue-collar 
friends, and vice yersa; when such choice occurs, status distinctions within 
the out-group are Ignored. This final model Is Just as parsimonious as that 
In the source; It estimates no parameter^ for Interaction. Yet. unlike 
Jackson's analysis. It fits the data well. 

Comparisons Between Classifications 

In a series of analyses my colleagues and I were able to locate no 
sociologically Interpretable historical changss In the relative occupational 
^11^11 1 ty chances of American men when we used saturated models of the mobility 
.9^1cat1on (llauser et al. l97Sb. Featherman and Uauser 1978: Ch. 3). 
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Reviewing these null findings we tfwught the lack of significant evidence 
of change might be the low statistical power uf our models, rather than the 
absence of change in mobilitff chances. A parsimonious nodel increases 
statistical -power in con^arlsons; the need for parsimony, as much as the 
issue of interpretation, motivates our efforts to apply models of the present 
form. While other models may be even more parsimonious, my modefs do use 
far fewer parameters to fit a given table than does the saturated model. 
In^iscussing comparative methods. I start with general hypotheses about 
the restrictions imposed by the model and later take up more specific 
hypotheses about the values of parameters or sets of parameters. 

If we borrow a model that fits one classification In order to fit a 
:ccond classification, we readily obtain an explicit test of the partition 
if cells in the initial mod^l . Recall, for example, that the model of Table 
? fits the aggregate table of mobility from father's occupation to son's first 
iccupation in the 1973 OCG survey with G^ = 66.5 ort 12 df. To test the 
.ssignment of cells to levels in this model. I use the same model in an 
nalysis of mobility from father's occupation to son's first occupation by age 
sing data from the 1962 OCG survey. That is. where ^ = father's occupational 
tratum. W - occupational stratum of son's first job. A = age in 5-year 
roups, and H « the model of Table 2. I fit the model (PA)(WA)(MA) to the 
962 data and obtain a test statistic of G^ » 121.? with 108 df. which Is 
ot statistically significant. That is. conditional on variation in occupa- 
ional origins and des titrations between cohorts, the same set ^f equality 
Jstrictions fits interactions between father*s occupation and son's first 
:cupat1on in the 1962 OCG survey as in the 1973 OCG survey. The lack of 
Ignificant departures from this model does not Indicate that mobility 

D 

wnces are numerically identical (or even remotely similar) In these two 
trveys. ^ I have tested only the hypothesis that the restrictions on 
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ffiter^cttons across ce1?s of the classlftcatfon are net tn both sets of 
data, not the hypothesis that Interaction parameters take on the same value. 

In a straightforward way we nay also test the equality of Interaction 
parameters aaong tv ** more cross-classifications. For example. I compare 
nobility from father's occupation to son*s first occupation across nine five- 
year age cohorts covered in the 1973 0C6 survey. Under the assumption that 
mortality Is negHglble during the priw working ag^s (20 to 6<), while 
first Jobs pertain to a /ixed point In the life-cycle* these coiMparisons 
reflect conditions of labor market entry during the period from the late 1920s 
to the early 1970s. As shown in the first column at the top of Table 22, I 
begin by iitting the model (PAMWA)(N). in which occupational origins and 
destinations vary across cohorts, but relative mobility chances do not vary. 

I have 9 mobility subtables, each with 16 degrees of freedom after condition- 

imi on the observed marginal distributions; since the five-level model of 

Table 4 uses Just four degrees of freedom, there are 140 degrees of freedom 

for t ror. Under this specification I obtain the significant test sfatistic, 
2 

• 235.3. At the same time the simple model of Table 2 is quite powerful; 

II explains » S667.7 with 4 df (Feathermn and Hauser ^9/8:200). 1 also 
fit the same table with the model (PA){UA)(I(A). which fits or\^ .\ and destina- 
tion effects as in the initial model, hut permSts the parameters of the model 
to vary across cohorts. Under this model 1 obtain « 175.6 with 108 df; 
relative to the initial model. I fit four more parameters for each of eight 
subtables. Again, the test statistic Is statistically significant, showing 
that there are non-chance departures from the specification within one or 
more cohorts. More important, since the model (PA)(UA)(HA) is obtained from 
(PA)(UA)(K^ ^y relaxing restrictions on interactions In the latter. 1 My 
test these restrictions by taking the difference between the two test stat- 

^j^^ I obtain 6^ ' 59.7 with 32 df (top line, third column of Table 22). 
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whose probability is ry small under the null hypothesis. Thus. i\\ere are 
statistically significant intercohort variations in parameters of the model 
of Table 2. In passing I note that the last test statistic is larger re1a~ 
tive to its degrees of freedom than the test statistic for intercohort change 
under the model {PA)(WA){PW) in the same classification; 0^ = 166.6 with 128 
df . for which a = ,012, 

TABLE"?Fl^BWrHlR£ 

To sunmarize. by conditioning on the marginal distributions and fitting 
the models of constant interaction and of variable interaction, subject to 
a given design matrix, we can test Ue fit of each of those two models; 
further, a contrast between those two models yields a global test of change 
in the parameters of the design matrix. Using a similar procedure we can 
test hypotheses about change in each level parameter of a cross-classification 
and at thir saiiie time we can test hypotheses about the lack of fit within eacti 
level of the design matrix. Again, we construct appropriate test statistics 
by contrasting liierarcMcal Hodels and exploiting the additive properties of 
tiie likelihood ratio test statistic (G^); see Bishop. Fienberg and Holland 
(1975:126-127). 

Figure 2 defines and shows the relationships among four types of models 
3f the crGSS-classification from which we obtain the desired test statistics. 

:onsider a four^way incomplete classification in vliich two factors wit. I 
ind J categories, respectively, represent the iiiobility i^j\e (or other cross- 
:1assif ication). a third factor with K categories specifies the ini action 
effects, and a fourth factor with L categories represents the replicates of 
the X J cross-classification which we wish to compare. Further, denote 
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Ihe nuMber of (Independently variable) cells within the k^^ level of the 
design matrix by M^^. 

As shown In the first row of Figure 2 | condition on the observed 

Marginal distributions of the I x J classification and fit models of constant 

association and of variable association tc the entries In the fcur-way classi 

fication. ider the model of constant association, e.g., (PA)(WA)(H) In the 

example of Table 22. I denote the 1il(eHhood ratio test statistic by 6^, and 

a 

under the —^^\ of variable association, e.g., (FA)(WA)(HA) in the example 
of Table 22. I denote the 1ll(e1lhood ratio test statistic by G^. Then, as 
shown in the second row of Figure 2, I blanic out or Ignore (by entering 
structural zeros) each of the counts at the k^^ level of the design matrix, 
and I fit the models of constant association and of variable association to 
the counts in the truncated four -way classification. I repeat this proce- 
dure for each of the IC levels of the design matrix. Under the model of con- 
stant association In the truncated classification i denote the lllcellhood 
2 

ratio test statistic by and under the model of variable association In 

2 

the truncated classification I denote the test statistic by 6^. Wtten I 
blank out or Ignore entries at the k^^ level of the I x J classifications I 
am Implicitly fitting constants to each of those entries, so the degrees of 
freedom under models of the truncated classification are reduced by the 
number of (Independently variable) counts at the k^^ level of each I x J 
classification.'^ 

By comparing the terms 6^, G^^, 6^, and G^, I can test a variety of 
liy^t|HBses about change In Interactions (level parameters) among the several 
1 X J classifications and about lack of fit within levels of the r^del. 

ERIC 



These contrasts and the associated degrees of freedom are shown In Figure 3. 

I have already ^plied (In Table 22) a contrast 11l(e that on Line 1, which 

gives an overall test of constancy In the level parameters across the several 

I X J classif icatioio. Recall that the model of variable association relaxes 

restrictions on paraft.eters of the model of constant association, so the 
2 

comparison, G^ - 6^, yields a test of con'*^ancy In all K level parameters 
across the L mobility classifications. Since a K-level design has K - 1 df. 
while the model of constant association specifies only one set of level 
parameters. It takes (K - 1)(L - 1) more degrees of freedom to specify the 
model of variable association than to specify the model of constant associa- 
tion. 

nGURry ABOUT HER E 

The contrasts on Lines 2 and 3 of Figure 3 are of lesser interest, but 

2 

I Include them for the sake of completeness. The comparison on Line 2, - 

2 

G^. Is analogous to that on the first line, but the test Ignores parameters 

at the k^^ level, which have been excluded both from the models of constant 

and of varlablfs association; thus, the test has (L - 1) fewer degr<?es of 

freedom than the global test on Line 1. In the model of constant association 

the L sets of counts at the k^^ level of the I x J classification are fitted 

b* a single parameter, but (implicitly) a parameter Is fitted tc each of 

these counts wlien entries at the k^^ level are blocked. Thus, the comparison 
2 2 

on Line 3 of Figure 3, G - G^, yields a test of the contilned effects of 
a c 

change and of lack of fit at the k^^ level of the design marrix, and this 
test has LMj^ - 1 degrees of freedom. 

The comparison of Line 4 of Figure 3, 6^ - G^, yields a test of fit aQ O 
the k^^ level of the design matrix across the L mobility classifications. 
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6^ is obtained under the wodel of variable association In the full classifi- 
cation, so It reflects lack of fit at each of the K levels across the L 
Mobility classifications. I obtain by relaxing the - I equality 
restrictions on Interactions at the k^^ level within each of the L Mobility 
ctassiflcatlons, so the test statistic has a total of L(H|^ - I) degrees of 
freedofli. Since the Models used to generate this test condition both on the 

Marginal distributions and Interaction paraMeters of the L distinct I x J 

2 2 

classifications, the overall cOM(>arlson, - G^, May be regarded as the sum 
of L independent test statistics^ eac.i with - I degrees of freedoM. per- 
taining to lack of fit within the k^^ level of one of the L classifications. 
Plainly, an analogous test May be obtained at the k^^ level of any I x J 
classification Merely by contrasting the fit of the full classification with 
that of the classification froM which counts at the k^^ level have been removed. 

In the fifth line of Figure 3 I show a coaparison of all four of the 
test statistics froM Figure 2 that tests the L < I equality restrictions 
iMposed by the Mode) of constant association at the k^^ level of the design. 
One May think of this comparison as a contrast between the Models of Lines I 
and 2 of Figure 3. or alternatively, one -nay think of It as a contrast between 
the H)iJki]% of Lines. 3 and 4 of Figure 3. That Is, the contrast in Llt>e 2 
differs froM that In Line I only in penaittlng the k^^ level paraweter to vary 
across the L classifications, and the contrast In Line 4 differs from that In 
Line 3 only In pennitting the k^^ level paraMeter to vary across the L classi- 
fications. 

I Illustrate soMe of the contrasts In Figure 3 by continuing my analysis 
of Intercohort change In Mobility to first occupations aMong American Men. 
In Lines I to S of the first panel of Table 22 I report test statistics and 
degrees of freedom under the Model of constant association as each level of 
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the deslon (Table 2) Is ignored In turn. Since le/els I, 2, and 3 each 
Include only I count per cohort, I use 8 df more than In the constant 
association model for the full table when I ignore the entries at these 
three levels. There are five entries In each of the nine cohort noblHty 
tables at level 4. When these 45 entries are Ignored, the remaining row, 
column, and level effects are each still identified, so f lose a total of 
44 degrees of freedom relative to the model of constant association In the 
full table. 

If I used the same rule to calculate the degrees of freedom lost In 
Ignoring level 5 as In Ignoring level 4, I wou^d end up with a negative 
number in Line 5, rati#er than the 24 degrees of freedom reported there. That 
is, there are 17 counts in each cohort mobility table at level 5 of the 
design, so one might (Incorrectly) say there are 140 - [(17)(g) - I] - -12 
degrees of freedom under the model of constant association when level 5 Is 
Ignored. The error in this calculation becomes obvious when wc Aspect the 
display In Table 2. First, sincp level 5 covers all of Row 3 and all f 
Column 3 of the mobility classification, there are only four Independent 
er^ries In each of Row 3 and Column 3. Second, by the same token there is 
no effect of ^ 3 nor of Column 3 when the fifth level of the mobility 
:lasslflcdtlon is Ignored. Thus, the erroneous calculation has subtracted 
four too many degrees of freedom for each of the nine Intercohort tables, 
^en we add these back In, we obtain the correct 24 degrees of freedom under 
lonstant association when level 5 is Ignored. 

Alternatively, we may obtain the correct degrees of freedom by enmner- 
itlmj the number of pdrameters fitted to the truncated classi f lea tlun. 
Ignoring the cells at level 5, consider the design in Table 2. for each 
;o»iort thev-e are two separate 2x2 subtables which have neither rows nor 
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coliiwis in coMNon. For each cohort It takes four degrees of freedoM to fit 
the lOMer right hand 2 x ? subtable, which has, of course. Just one degree of 
freedon for Interaction. We use four degrees of freedom, also. In fitting 
the upper left hand 2x2 subtable. Since three level parameters appear In 
Ihe upper left hand subtable. It My appear that the model for that subtable 
Is underldentified. That is not the case because one of the Interaction 
pafameters (from level 4) is determined In the lower right hand 2x2 sub- 
table. In suwnary, I fit each truncated cohort table exactly (exhausting 
the eight degrees of freedom of the 8 counts) with a model in which four 
Interaction levels appear in the two 2 x 2 •subtable'. This explains why there 
are no degrees of freedom for error when I fit the model of variable associa- 
tion to the classification from which level 5 has been deleted. When I fit 
only one set of level parameters to the truncated classification under the 
model of constant Interaction, I have (4 - I) x 8 • 24 degrees of freedom 
for error, which is the same result obtained above by subtraction. 

In the second part of Table 22 I show values of G and degiees - ' freedom 
under the model of variable association as each level of the design (Table 2) 
is Ignored In turn. In the full table the model of variable association 
leaves 108 degrees of freedom for error. Since levels I, 2, and 3 of tne 
design each Includes only I count per cohort, the fit does not improve, nor 
do tM degrees of freedom change, wlien I Ignore the cells at those levels 
under the model of variable association. Since level 4 has five (indepen- 
dently variable) counts in each of the nine cohorts, and one degree of freedom 
is used at this level for each cohort by the model of variable association, 
I lose four degrees of freedom for each cohort when I Ignore level 4 In 
flt|«n|^ the model of variable association; this leave 72 degrees of freedom 
" for ^rror. As explained above, the degrees of freedom are exhausted when 
gP^(];e level 5 In fitting the model of variable association. 
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Recall from Figure 3 that the contrast between the test statistic for 

2 

the full table under the model of variable association (6 » 175.6 with 108 

df) and the test statistics below It in the second panel of Table 22 are of 

the form, 6^ - G^. That Is, they reflect lack of fit at each level of the 

design. Since there is only one count at each of levels 1, 2, and 3, there 

IS no lack of fit at these levels under the model of variable association; 

this is, of course, a formal property of the^esign in Table 2, and it has 

no confirmatory value. Wtien I coiiv)are the test statistics in Lines I and 4, 
2 

I obtain G « 34.8 with 36 degrees of freedom. Since the test statistic is 
less than its expected vaK«e, I Infer that the fit is satisfactory al level 
4 of the design. There are no degrees of freedom for error when I ignore 
level 5 or the design, so 1 may attribute all of the lack of fit (G^ = 175.6 
with 108 df) to level 5. Obviously, this test statistic is significant, and 
the result suggests I might refine the present design by splitting level 5 
in Table 2 into two or more levels. Note these tests for lack of fit (and 
the coffi|ian1on tc:»ts for change reported 'jater> are not inuependent; ihete Is 
no inconsistency between ny use of the ovetill test statistic for erroi lo 
evaluate the fit at level 5 and a compofient of the same statistic to eval- 
uate the fit at level 4. 

In the third panel of Table 22 1 report on each ¥^le^the differences 

t»etween the entries on that line in the first and second panels. On the 

2 2 

first line the difference is a test statistic of the form, G^ - G^, and on 

2 2 

the remaining lines the differences are test statistics of the fonn, G^ - G^. 
3n the first line I obtain the overall test statistic for change in level 
parameters, and the remaining lines give the test statistics for change at 
sll but the specified level of the de<;^on matrix. Since the full cross* 
:lassification has K > 5 levels of interaction, and 1 compare L - 9 cohort 



tables, each of the tatter test sUtlstlcs has (I - l)(iC > 2) » 8 x 3 ^ 24 
^rtes of freedCHi. To obuin the test sUtlstlcs for change at each level 
of the desi9«i, i again take differences between the test statistic for the 
full table and that on each line beloM It (recall line 5 of Figure 3). 
these are reported In the fourth panel of Table 22. Intercohort changes In 
parai^ters at levels 1 and 3 are plainly not significant, while that at level 
2 is (on ay reading) of borderline statistical significance. At levels 4 

and 5 there is statistically significant change in the para«ieters, whether 

2 

i evaluate the nominal probabilities associated with the values of G or 
use an appropriate simultaneous inferential procedure (Goodnan 1969a). 
Again, these test, for change are not statistically independent; note that 
the SUM of the test statistics for the five tests is larger than the test 
statistic for the global contrast (59.7 with 32 df), and the suw of degrees 
of freedoM In the five tests is 40, r^XJii^r than the 32 degrees of freedom for 
change in the level parameters. 
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h assume the familiarity of the reader with log;inear models for 
frequency data. Flenberg (1970a. 1977). Goodman (1972a. 1972b) and Davis 
(1974) give useful introductions, as does the comprehensive treatise by 
Bishop. Flenberg and Holland (1975). I rely hoavlly on methods for the 
analysis of Incomplete tables, which have been developed by Goodman (1963, 
1965, 1968. 1969a, 1969b. 1971. 1972c). Bishop and Flenberg (1969). Flen- 
berg (1970b, 1972). and Mantel (1970); again. Bishop. Flenberg and MoTland 
(1975, especially pp. 206-211, 225-228. 282-309. 320-324) is valuable. 
Applications of logllnear models to occupational mobility data include 
several of the papers by Goodman just cited and, also. Hope (1974, 1980). 
Hauser et al. (1975b). Pullum (1^75). lutaka et al. (1975). Featherman. 
Jones and Hauser (1975), Ramsey (1977). Hauser and Featherman (1977). Baton 
(1980). Hauser (1978). Goldthorpe. Payne and Llewellyn (1978). Goldthorpe 
and Payne (1980), Duncan (1979), and Feathernvia anu iiauser (1978). 

^In the I K J cross-classification there are (1 - 1) (J - D degrees 
of freedom for two-way interaction. The conventional structural model yields 
two-way Interaction effects for each of 1 x J counts by constraining the 
product of Interaction effects within each row and within each column of 
the table; these constraints identify (I - 1) (J - U independent inter- 
action effects. Instead, the model of equation 1 identifies the two-way 
interaction effects by constraining some of them to be equal across cells 
of the classification. 
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Wn^.pf_thesejHide1s--as weH as problems In coMparIng their g^dness 
of fit— ire reviewed by BUhop, Flenberg and Holland (1975): Chs. 5, 8, 9), 
«fid sone of the SMie nodels are discussed by Haberman (1974: Ch. 6). Duncan 
(1979) and GoodMn (1979b) have recently proposed additional models for 
ckasslflcatfons of ordered categorical data. 

denote parameters of the logllnear model In equation 2 by u*, 
etc.. In order to distinguish them from the u<terms of the conventional row- 
by-column parameterization (Bishop, Flenberg, and Holland 197S: Ch. 2). 

^The reported frequencies are based on a complex sampling design and 
have been weighted to estimate population counts while compensating for 
certain types of survey nonresponse. The estimated population counts have 
been scaled down to reflect underlying sample frequencies, and a further 
downward adjustment was made to compensate for departures of the sampling 
design from a simpie random sampling. The frequen'"? estimates In Table I 
h^ve been rounded to the nearest Integer, but my computations are based 
on unroundt*d figures. I treat the adjusted frequencies as IT they had 
t>een obtained under simple random sampling (Featherman and Hauser l978:App.B). 

Occupation, Industry and class of worker were coded In detail using 
classification methods of the I960 U.S. Census, and the detailed occu- 
pation codes were aggregated to form the broad groups shown In the table. 
The broad occupation groups have been defined In a slightly unconventional 
way (on the basis of data on the schooling and Incomes of current occupa- 
tlonal Incumbents). Sales workers other than retail sales workers have been 
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placed In the upper nonmanual group, while propr1etors*have betn placed 
In the lower nonmanual group. This does not substantial ly affect t.ie 
findings relative to those based on a more conventional classification of 
jpper and lower white-collar Jobs (Featherman and Hauser, 1978:100-184). 

^Thls observation is elaborated by Featherman and Hauser (1978:177-100), 

'unsu:>scr1pted e Is the base of natural logarithms and should not be 
confused with tl>€ sample residuals In the multiplicative model, e^ = 

\arntz (1978) has shown that z^j has better small-sample properties 
than ^0 coi^>onents of (the likelihood-ratio test statistic) or Freeman- 
Tukey deviates. ^ 

^See Featherman and Hauser (19/8:66-67) for further discussion of these 
patterns. 

^^Just as I have shown *hat parameters of the saturated model are mixtures 
of parameters of my model. It Is also easy to show that the Hosteller ad- 
justments yield Indexes which are mixtures of the paramsters of my model. 

^^In more complex models other method*; may be needed to estimate 
the missing frequencies, such as those used to estimate parameters for 
models of the full table. The manual computations are often convenlc.t, 
and £CTA converges more rapidly when cells with unique parameters are 



l^i^ed than when the pro<jra» Is forced to fit theii exactly. 

tr ^ 

It took the author about an hour to prepare this Illustration using a 

saiall electronic calculator. 

^^h1$ example was developed by Brian Clarfldge. Both the subsanples 
of respondents and of their siblings were highly stratified by sex and 
educat1o»»al attalnnent, but I have treated the data as If they were ob- 
tained by sliHple randoM sampling. 

'Shfs example was developed with the assistance of Shu-L1n0 Tsu{. 
The Detroit data were collected by Edward Laumanni 1 estimated the counts 
from percentages and marginal frequencies In a secondary analysis, of these 
data by R. Jackson (1977). For this reason the counts In Panel A may differ 
slightly from'those In the original data. 

15 ' 

There are methodol^lcal differences In the measurement of flrst^ 
occupations 1r. these two surveys, and for this reason It would not be sur- 
prising *f the two models (or their parameters) differed substantially. 
For more details about this camparl^on see Featherman and Mauser (1978: 
200^208) . 

'She degrees of freedom of- the statistics and may be difficult 
to enumerate because they depend on the pattern of the design matrix as 
well as on the number of cells at the k^^ level. For example, when level 
S of the design In Table 2 is blanked out, we delete only 4 independent en- 
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tries in each of the third row and the third colupm. and we no longer need 
' to estimate marginal effects of the third row or the third coIulu of thr 
mobility cldssification. Thus, one is well-advised to apply the general 
rule that the degrees of freedom for error under a model is equal to the 
number of cells in the classification less the nun4)er of Indepeiuient 
constants fitted to It. 

^'See Featherman an(f Hauser (1978:200-208) for further discussion of 
these results. 
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counts in a classification of nobility froM Father's (or Other Faaily 
M««<l*s) Occupation to Son's First Full-tiae civilian Occupationi J.C, Men 
k9Bd 2fr 64 in 1973 



Son's occupation 



rath«r*s 

occupation 


Uppar 
Non«anoal 


Lowar 
aofiMsntattl 


Manual 


Lower 
Manual 


fana 


Total 


Oppar Nonaanual 


1414 


521 


302 


643 


40 


292C 


UH#ar NonManual 


724 


524 


254 


703 


48 


2253 


0|>p«r Manual 


798 


648 


656 


1676 


108 


4086 


loMsr Manual 


756 


914 


771 


3325 


237 


6003 


ram 


409 


357 


441 


1611 


1832 


4650 


Total 


4101 


2964 


2624 


7958 


2265 


19,912 



JRTfffi Counts are based on observations wei9hted to estiiMCe 
population counts and coiRpansate for departures of the 
sas^ilinq design frow simpla randow saiipling. Broad occupa- 
tion groups are upper nonnanuali professional and kindred 
%ft>rksrs» Mnagers and officials # and non^retail sales 
%forliersf lower nonswnuali proprietors* olsrical and hindrsd 
%rozke«'s» and retail salsst#orkersi upper nanuali craftsmen, 
foresMn and kindred %forkerai lower aianuali service workers, 
oporativss and kindred workers # and laborsrs* except farai 
fhrmt farMrs and fajna Managara^ f^ni iaborora antf forasMn. 
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TABLE 2 



Asymotric Five- level Model of Mobility from Father's Occupation 
to First Fiill-tinw Civilian Occupation 



Son's occupation 



Father's 
occupation 


1 


2 


3 


4 5 


1. Upper Nonmanual 


2 


4 


5 


S 5 


2. Lower Nonmanual 


3 


4 


5 


5 S 


3* Upper Manual 


5 


5 


5 


5 5 


4. Lower Manual 


S 


S 


5 


4 4 


5. farm 


5 


5 


5 


4 1 



JVOTff Broad occupation groups are upper nonmnualt 

professional and kindred workers » managers and 
officials, and non-retail sal^s workers) lower 
nonmanuali propristors, clerical and kindred 
workers, and retail sales-t^rkersi upper sianuali 
craftsmen, foresMn and kindred %#orkersf lower 
man*iali serrice workers^ operatives and kindred 
tforkers, and laborsrs, except farmi farmi farmers 
andf farm manage rs» farm laborers and foremen. 



Cutlwited farmmeiern of the Hodel of Table 2: HohllUy from PathcrS 
(or Otl>er Faaily llead'ji) Occupation to Son's rirst Full-tlBC Civilian 
Occupation, U.S. Men Aged 20-64 in 1973 ^ 



Category of yow» colttwn» or lavl 



Dasiqn factor 


(1) 


(2) 


(3J 


(4) 


(5) 


llo%#a (fJithar** occupation) 


-.466 


-.451 


*495 


.570 


-.148 


Coluana (son** occupation) 


.209 


. 190 


.240 


1.020 


-1.660 


Uv«la (d«%alty) 


3.044 


l»k34 


.549 


.243 


-.356 



Grand Man • 6.2/7 
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TARIf 4 



Loq of Ratio of Observed to Expected frequencies in the Model of Table 2 
Hol)ilily from Father's (or Other Family llcad*s) Occu|>ation to Son's Firs 
FuH-time Civilian Occupation* U.S. Hen Aged 20-64 in 1973 



Son's occupation 

Father's 

occupation 12 3 4 5 



1. 


Upper Nonsianual 


.00* 


.01 


.02 


-.01 


-.10 


2. 


Lower Honmanual 


.00* 


.00 


-.17 


.06 


.06 


3. 


Upper Manual 


.06 


-.13 


.10 


-.01 


-.07 


4. 


Lovar Manual 




.14 


-.08 


.00 


.04 


5. 


Fans 


.03 


-.09 


.08 


-.01 
• 


.00* 



rittad exactly undar tha w>dal. 
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TABLE 5 

8t«n4praixed Retiduats froa tlio Hodel of Table 2? Hobiiity tiou Father 
(or Othir raaily Head's) Occupationr to Son's first rull-tiae Civilian 
Oi;capaticn, U.». Men Agrd 20-64 in 1973 



1rsthsr> 

oecupation 




Son** occupation 




1 


2 


3 


4 


5 


I. Uppsr HoiMMAual 




.26 


.29 


-.24 


-.60 


3. Lowsr NonMinial 


.CO* 


.04 


-2,76 


1.71 


.45 


1. Upp«r Manual 


1.60 


-3.38 


2.6a 


-.51 


-.74^ 


4. Lowar fUMiual < 




4.14 


-2.28 


-»05 


.60 


5. ram 


.65 


-1.65 


1.58 


-.32 . 


.00* 



ritt«a ax^ctly Ufidar tha K>dal. 
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TAOLK 6 

* 

New Mobility Ratios (R^^) Under the Model of Table 2i Mol>i>ity froi» 

Father's (or other Fasiily Head's) Occupation to Son's First Fun-ttsK 
Civilian Occut'ation. U.S. Men Aged^O-64 in .1973 



Son's occupation 



Father's 
occupation 

^ — 


1 


2 


3 


4 


5 


1. Upper Noniuhual 


3.42 


1.20 


.71 


.70 


.64 


2. Lower Nonmanual 


1.73 


1.26 


.59 


.75 


.75 


3. Upper Manual 


.74 


.61 


.77 


.69 


.65 


4. Lotfer Manual 


.65 


.80 


.64 


1.27 


1.32 


5. Farm 


.73 


.64 


.76 


1.27 


20.91 



bi 

'J 



tabu: 7 



standard Outflow and Inflcjw Tables: Modility from Father's (or other 
rasilly Hcvinl'a) Occuftatioii to ~on*s First Full-time Civilian Occupation* 
U.S. Men Aqod 20-64 in 1973 



8on*s occufMitlon 

rmkh*r*m 

OCCii|»afciof» iiy (3> (3> |4> iSf Totmt 



Outflow- 



1. Upp«r Non«anuaX 


48.4 


17.8 


10.3 


23.0 


1.4 


100.0 


2. Uiwar NofMnual 


32.1 


23.3 


11.3 


31.3 


2.1 


100.0 


J. Vppmt H«nii4l 


19.5 


15.9 


2^a 


41.0 


2.6^ 


100. 0 


4. Low«r Manual 


12.6 


15.2 


12.8 


55.4 


4.0 


100.0 


S» Fani 


8.8 


7.7 


9.5 


34.6 


39.4 


100.0 


Total 


20.6 


14.9 


13.2 


40.0 


11.4 


100. Q. 
















1. U|>p«r dtonManual 


34. b 


17.6^ 


11. 5r 


P.l 


1.8 


14.7 


2. tovai NonAanual 


17.7 


17.7 


9.7 


8.8 


2.1 


11.3 


Ur?«r HAJtual 


19.5 


21.9 


32.6 


21.1 


4.9 


20.5 


4. Lower NMiual 


18.4 




29.4 


41. & 


10. S 


30.1 


S. Fans 


to.o 


12.0 


16, B 


20.2 


80.9 


23.4 


Total 


100.0 


100.0 


100.0 


100.0 


100.0 


100 0 



MOrti Frequencies are based on observations weighted to e .Imate 
population counts an4 cosiponsate for departures of the 
saaiplinq design fro« simple randosk saMpling. Broa'* cccupa- 
tion groups are upper nonManuali professional Indred 
worlterSf Managers aiMl offftialSi and ncn-retail ^^iea 
worKer.ij lower nonManuali propristorn# clerical and kindred 
woTKsrs# and rstail sftlss-*workersj upp^^r sMnuali craftssMn*^- 
foreaen and kindred workers s lower sianuali service workers « 
jperativea atid kindred workers^ and lfteksrs» except farwj 
fa«fiii farmers ent' fsm Managers « far« laboxf-rs and foreaen. 




}Xi\ Mobility Ratios <R .) under the Model of Siniple Independonco : 
kjbility from Fathor*s^Mor Other Family Head's) Occupation to Soi 
'irst FuU-tirao Civilian Ocruprtion U.S. Men Aged 2()-6A In 1973. 



Son*s occupation 

Father's ~ 

siccupatlon 1 2 3 4 5 

1. Upper Nonmanual 2.35 1.20 .78 .55 .12 

2* Lower Monwanual. 1*56 1.56 *7a 09 

3. lipper Kanual .95 1.06 1.59 1.03 .23 

4. Lower Manual .61 1.02 .98- 1.39 0.35 

5. Far- .43 -52 .72 0,87 3.46 



TABLE 9 

Raw «nd Adjusted Standardised Deviates under the Itodel of Slsple Independence! 
Not>illty fro* father's ior Other Family Moad»5> Occupation to Son's rirst 
rull-tise Civilian Occupation, U.S. Men Aged 20-64 In 1973 



8on*s occupation 



Father's 

occupation 


iU 


Ul 


(3) 


C4) 


(5) 




StMMid^diz^d drnvimtmrn- 






1. Upper NoniMnual 


33.13 


4.14 


-'4.23 


-15.33 


-16.02 


2. Lower ^ nnanual 


12.07 


10.31 


-2.49 


-6.60 


-12.99 


3« Uppsr Manual 


-1.50 


1.60 


13.68 


1.08 


-16^ 


4, Lower Manual 


*13.6S 


.58 


-.70 


16.69 


-17.07 


S. ram 


^ -17.73 


-12.74 


-6.94 


-5.74 


56.65 




Adjusted MtsndMcditrnd 


da via tas 






1. Upper Nonaanual 


48.89 


5.70 


-5.71 


-29.93 


-21.18 


2. Lower Nonwuiual 


?7.14 


13.66 


-3.24 


-12.39 


-16.52 


3. Upper Manuel 


-2.38 


2.37 


19.82 


2.27 


-23.52 


4. Lower Manual 


-24.61 


1.15 


-1.15 


45.05 


-27.57 


5. rem 


-29.14 


-19.53 


-10.42 


-12.47 


83.40 



TABLE 10 



Hultlpllcative (l) Parameters In a Saturated Model of Hubility fion 
Father's (or Other FaMlly Head's) Occupation to Son's rlrst Full-tine 



Civilian Occu(iatloii, U.S. 


Hen Aged 


20-64 in 


1973 






Father's 

occupation 




Son's occupation 




1 


2 


3 


4 


5 


1. Upper lloru»anual 


2.76 


1.37 


.96 


.72 


.38 


2^ Lower Nonsuinual 


1.58 


1.54 


.90 


.88 


.52 


3. Upper Manual 


.92 


1.00 


1.60 


1.11 


.61 


4. Lower Manual 


.67 


I.C? 


1.04" 


1.53 


.96 


5. Farm 


.40 


.47 


70 


.90 


8.67 



Tahir 11 

Raw ani) Nultlpl icar ively Adjusted Frequencies in a Hypothetical Tabie 



ColvmM 

Hows ^ 3 Total 

Kaw fr«qu«ncl«« 

» "ii "12 «ii "la 



"21 22 21. 

Total ♦ ♦ »22 H 

1 ac»^^ •(ci«^^4dx^^) 



Total ''^•*ii * ^"21* ***"i2 * ^"22^ c{aji^^ ♦ bx^^) 
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TABLE 12 



Doubly Standardized Frequencies of Hi>bility froa Father's (or Ottit>r Fdinily 
Head's) Occupation to Son's First Fuil-tine Civilian Occupation. il.S. Men 
Aged 20>64 in 1973 



Son's occupation 



ratner '8 
occupation 


X 


2 


3 


4 


5 


Total 


1. Upper Nonaianual 


7 -^-^ 


1.16 


.87 


.66 


.21 


5.00 


2. LcMer ^nmanual 


i.as 


1,50 


.92 


.91 


.32 


S.OO 


3. Upper Hanual 


.dO 


1. 00 


1.66 


1.16 


. j9 


5.00 


4. Lowec Manual 


.56 


i.oa 


1.10 


1.69^ 


.62 


5.00 


5. Fans 


.22 


.29 


.45 


.59 


3.46" 


5.00 


Total 


5.00 


5.00 


5.00 


s.oo 


5.00 


25.00 
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TABLE U 



Design Harrices for Three Models of ^aai- Independence in Uie 5 L»y S 
Mobility Table 



Son^s occupation 



rathar ■ 




Fartia 




tabla 




run tabu 




















Mod*! occupation 


I 


2 




4 


s 


12 1 


4 


S 




0 


1 




1 


1 


2 1 1 


1 


1 




1 


0 




1 


I 


L 3 1 


1 


1 




1 
1 


1 




X 


1 


1 1 4 


1 






I 


I 




0 


1 


1 I L 


S 






1 


1 




1 


0 


111 


1 


6 




0 


0 




1 


1 




1 


1 




0 


0 




I 


1 




1 


1 




1 


1 




1 


1 




1 


1 




1 


1 




0 


0 




7 


8 




1 


1 




0 


0 




9 


10 


Q3 1 


0 


0 


1 


1 


1 




I 


I 




0 


0 


0 


1 


1 


4 5 6 


1 


1 




1 


0 


0 


0 


1 


17 8 


9 


1 




1 


1 


0 


0 


0 


1 1 ' 10 


11 


12 




X 


1 


1 


0 


0 


111 


13 


14 
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TABLfc 14 



SujMWtry of Fit of Selected Hultipl icativc Models: Mobility from Father 
(or Other Family Head's) Occu(>ation to Son's First Full-time Civilian 
Occupation, U.S. Men Aged 20-64 in 1973 



Mod^l 




p" 


df 






Independence 


19^913 


6167,69 


16 


20.1 


100.0 


Ql — Main diagonal 
blocked 


11,963 


64:J.06 


11 


5.5 


11.1 


Q2 — diagonal and 
intra-straLun 
moven blocked 


B,669 


50.05 


7 


1.4 


0.8 


Q3'***di agonal and 

Inner dla7onala 
blocked 


5,520 


15.67 


3 


0.6 


0.3 



*Su» ot frequencias excluding thosa flttad exactly- 
under tha aodal. 



ThBlt lb 



Ratios of Observed Frequencies to EstlMted rrequencles at Quasi- 
XfMlependent Level Utnlcr Three Models of Quasl-Independences Mobility 
frosi rattler's (or Oiticr faaliy Head's) Occu|Mitloa to Son's First 
rull-tl rivMiAn Ocru|>atlon, U.S. Men Aqed 20-64 In 1973 



*"-n*» occupation 





Father's 

occupation 


1 


2 


3 


4 


5 


Ql 


1. 


Upper nonauinual 




1.51 


1.06 


./9 


.62 


2. 


Lower nonaanual 


l.?3 


1.35* 


.79 


.77 


.66 




3. 


Upper MiiuaX 


1.06 


.93 


1.48*- 


1,02 


JBT 




4. 


Lower manual 




1.06 


1.08 


1.63* 


1.45 




5. 


Farm 


.71 


.67 


.99 


1.28 


18.13* 


fta 


I. 


Upper noHAanual 


4.94* 


1.86* 


1.08 


.97 


.93 


2. 


Lowerngniaanual 


2.48» 


1.84* 


.89 


l;04 


1.10 




3. 


Upper atanual 


1. 10 


.91 


1.21- 


I. CO 


.98 




4. 


Lower sunual 


.91 


1.13 


.96 


1.74* 


1.90* 




%. 


Far* 


l.OO 


.89 


1.11 


1.70* 


29,77* 




1. 


Upper m>naianual 


5.33* 


1.81* 


.98 


1.01 


1.04 


2^ 


UMec noiwanuaX 




U64* 


.74* 




X.13 




3. 


tipper iMnual 


1. 01 


.76* 


.9 3* 


,88* 


.94 




- 4^- 


Lower^ manual 


.94 


1.05 


.82* 


1.73* 


2.03* 


1 


5. 


F«r» 


1.10 


.89 


1.02 


1.81* 


33.92* 



*C«11« l9nor«4 (or (ittad exactly) under th« aod.!. 
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TAULE 10 



Ex|>ected Frequencies Under Model Ql: Mobility fro» PaU»er*s (or othor 
Favily Head's) Occupation to Son's First Full-tiwe Civilian Occup«»tlon, 
U.S. Men Aged 20-64 In 1973 



Son''« occupation 



Father's 

occupation 


1 


2 


3 


4 


5 


1. . Upper nonnanual 


372.3* 


344.1 


285.6 


811.5 


60.1 


2, Lower nonmanual 


319.5 


387.7* 


321.8 


914.5 


73.3 


3^ UppAx Manual 


754.9 


697.7 


579.0* 


1645.5 


131.9 


4. Lower Mnual 


934.8 


864.0 


717.0 


2037.7* 


163.3 


5. Fai^ 


578.6 


534.8 


443.8 


1261.2 


101. 1* 


*Cells ignored 


(or fitted sxactly) 


under the 
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TABLE 17 



"Manual PTt of Modrl Q3 to Table 1 



ERIC 



fon*0 occupation 



oecup«tion 



CTfcct 



A. Log of fr«qu«Dcy in i*^t»d c«Xls 



KOW MAIL. 



1. 


Opp«f nofUMJiiMiL 












S.29X 


7^ 


tamm^ Donaan— 








6.5S5- 


3.M^ 


S>2X»- 


3. 


Upp«r MnuaX 


6.682 




— 




4.680 


S.68X 


4, 


tower ■*nuAl 


O » 


o • oxa 








6*724 


9. 


fan 




9. a «e 


e .099 






5.994 






p* Data 


Xasa row 


Mans 








1« 








.4l2P 




• 1 *^i%5 




2. 


Lower ncnMntMl 


- 






X.337 


-X.337 




J. 


Opptr aAAMl 


x.oox 




— 


— 


-X.OOX 




4« 


Lower sbauaI 














S* 
















CO-it 






- Oil 


• «9 # 




. Ill 








C* Data 


Xeea ooX\ 


■n Baana 








!• 


Upper noniHAntMiX 






1 £9 

• io« 


_ AMI 

— .UaX 


— .284 


-.081 


2. 


Lower noAMnoal- 


• 






.09X 


*.026 


.028 


J. 


Opper BAniMl 


.692 




— 


— 


• 3X0 


.50X 


4« 


Lower Mimal 


•^.404 










— «yl5(L 


S« 


feni 


••209 


-. X05 


— .Io2 












D» Data 


Xesa row 


»ani ani 








!• 


Upper QonMAUAX 






f«ej 


• 000 


— .203 




2. 


Lower noniMmiAX 


- 


- 




!o53 


-.054 






Opper MnuaX 


• ISX 




— 




-.X9X 




4v 


Xrf9w«r Mfi^jAX 


_ •*c^ 
" .254 












5« 


fan 


••147 


. 09 # 
















1 ca 


. i«« 


. W« / 


^191 








8* Data 


Xeaa coXi 


■Ml aediane 






Pow aetiXaA 


1. 


Upper nonMnual 






.12X 


— . w27 


— . 01« 


— t. 


2. 


Lower fiofiBAnuAl 








.026 


.X3T 


.082 


3. 


Upper sAAual 


.:)xa 








0 


.XS9 


4« 


Lower aenttAl 


-.127 


.099 






r • 




s« 


fmsm 


0 


-.099 


-.X23 






« ooa 






r. Data 


Xeea tow 


■adlana 






• 




Upper nonMmuiX 






.X33 


— . Ol9 






2. 


Lower nofUMnuaX 








-.Q54 


.055 






Upp«r MnuaX 


.159 






— 


-.X59. 




4. 


I'M^t MimeX 


•.XX3 


.XX3 










$• 


fars 


.099 


.000 


-.023 








Col 


Ml ■odiei) 


.099 


.os^ 


.055 


_ ma 

— . U J9 


.WW 








G. Data Xeea eoXum, Mdiana 






. 

iHw seoia^ 


1. 


Upper nonMnaal 






.079 


h02X 


f ooo 


. 0^1 


2* 


Cower fwnpenmbX 








-.020 ' 


.059' 


.0X8" 


3. 


Upper nanuaX 


.060 






— 


-.X59 


-.050 


4. 


Lower nanoaX 


-.n: 


.057 








• 07e 


S. 


ram 




-.054 


-.079 






— .09% 






S. Data 


Xeea row 


aediafU 








I. 


r nOfwanuaX 






.057 


.000 


-.02X 




2. 


Lower noiuMnuaX. 








-.039 


.037 




3. 


Upper MnucX 


.110 








-.X09 




4. 


Lower MnuaX 


•.X34 


.X33 






m 




9. 


rarw 


•.056 


.000 


-.022 








Col 


tawi radien 


-.OW 


.049 


.0X8 


-.0X9 


-.02X 








X. Data 


Xeae coXtfV) nadi«ne 






Kow »«dian 


I. 


Upper nofvanuaX 






.039 


.0X9 


.000 


.019 


2. 


Lower nonflianttaX 








-.0X9 


.058 


.0^0 


3. 


Upper aanuaX 


.054 








-.088 


-.017 


4. 


Lower iMfiwaX 


-.190 


.047 




• 




-.062 


%. 


fan 


0 


*.04t 


-.040 






-.040 



80fgi 8ae ta«t fof 8«gXafiat;ioii. 



TABLE 18 



Cianulation of JIffects from Table 17 



Occupatioa 



Panel of Table 17 



B 



H 



Rw effects 



Total 



1. Upper nonmanual 5.291 

2m Lower nonmanual -5.218 

3. Upper Bianual S.6&L 

4. Lower manual 6*724 

5. Farm 5.994 



-.081 
.028 
.501 
-.150 
-.162 



► 012 
,082 
.159 
.014 
.099 



.021 
.018 
.050 
.078 
.056 



Column e fleets 



1. Upper nonisanual 

2. Lower nonmanual 

3. Upper manual 

4. Lower manual 

5. Farm 



.309 
-.011 
• 257 
1.256 
-1.311 



-.127 
.156 
.122 
.027 

-.191 



,099 
,056 
,05S 
.036 
.000 



.066 
.068 

.oia 

.019 
.021 
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TABLE 19 



Summary Analysis of Model Q3 Fitted by Medians 



Fa 
occ 


ither's '— 


Son's 


occupation- 






rapation 1 


2 


3 


4 




A. 


Ex*^^cted logs of frequencies in cells ignored in fitting 




1 


Upper' noninanual 5.556 


5.488 




_ 




2. 


Lower nonnaniial 5.683 


5.615 


5.798 




- 


3* 


Upp^ nanual 


6.560 


6.743 


7.519 




4. 


Lower nanual^ — 






7.71(y 


4.959-- 


5. 


Fan* - 






6.905 


4.154 


B. 


Observed less. Axpacted Loga o£ frequencies 






1 


Upper noninanual l>69a 


.,768 


.039 


.o:.9 


nnn 


m • 


Lower noninanual .'902 


.647 


-.261 


-.019 


.U30 


3. 


Upper manual .054 


-.086- 


.009 


-.095 


-.088 


4. 


Lower nanual -.190 


' .067 


-.286 


.399 


.509 


.5. 


Tana .000 


-.068 


-.040 


.480 


3.359 


C. 


An ti logs of entries in panel 


B 








1. 


Upper nonnanuSal " 5.46 


2.16 


1.04 


1.02 


1.00 


2. 


Lower noninanu;^! 2.46 


1.91 


.77 


.98 


1.06 


3. 


Upper manual' 1.06 


.92 


1.01 


.91 


.92 


4. 


Lower manual .83 


1.07 


.75 


1.49 


1.66 


5. 


Fam 1.00 


.93 


.96 • 


1.62 


28.76 
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TABLE 20 



Respondent's Education (Years of School) by Sibling's Education: 
Wisconsin San^le 



Sibling's Education 



ERIC 



9 



Education 


'12 

— - 


13 


14 


15 


16 


17+ 


Total 


A. 


Frequencies 


















1^ 


881 


51 


53 


27 


109 


58 


1179 




13 


65 


8 ! 


7 


7 


IB ~~ 


6 


111 




14 


40 


9 , 


14 


7 


14 


11 


95 




15 


23 


7 


4 


2 


6 


6 


48 




16 


91 


20 


12 


9 


11 


53 


262 




17+ f 


57 


14 


13 


4 


59 


51 


198 




Total 


1157 


109 


103" 


56 


283 


185 


189 3 




Row Percentages 


















tl \^ 


74.7 


4.3 


4.5 


2.3 


9.2 


4.9 


100.0 




13 


58.6 


7.2 


6.3 


6.3 


16.2 


5.4 


100.0 




14 


42.1 


9.5 


14.7 


7.4 


14.7 


IT .6 


100.0 




15 


47.9 


14.6 


8.3 


4.2 


12. 5 


12.5 


100.0 




16 


34.7 


7.6 


4.6 


3.4 


29.4 


20.2 


100.0 




17+ 


28.8 


7.1 


6.6 


2.0 


29.8 


25.8 


100.0 




Total 


61.1 


5.8 


5.4 


3.0 


14.9. 


9.8 


100.0 


C. 


Column Percentages 












- 




12 


76.1 


46.8 


51.4 


48.2 


38.5 


31.4 


62. ,3 




13 


5.6 


T.3 


6.8 


12.5 


6.4 • 


3.2 


5.9 




14 


3.5 


8.3 


13.6 


12.5 


4.9 


5.9 


5.0 




15 


2.0 


6.4 


3.9 


3.6 


2.1 


3.2 


2.5 




16 


7.9 


18.3 


11.7 


16.1 


27.2 


28.6 


13.8 




17+ 


4.9 


12.8 


12.6 


7.1 . 


20.8 


27.6 


10.5 




Total 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


D. 


Estimated Frequencies Under Syrrmetry (G 


^ « 11.59 


with 15 


df ) 






12 


b31 


58 


46.5 


25 


100 


57.5 


1168 




13 


58 


8 


8 


7 


19 


10 


110 




14 


46.5 


8 


14 


5.5 


iJ 


1^ 






15 


25 


7 


5.5 


2 


7.5 


5 


52 




16 


100 


19 


13 


7.5 


77 


56 


272.5 




17+ 


57.5 


10 


12 


5 


56 


51 


191.5 




^otal 


1168 


110 


99 


52 


272.5 


191.5 


1893 


E. 


w (or Column) 


Percentages Under 


Symmetry 










12 


?5.'4 


5.0 


4.0 


2.1 e 


8.6 


4.9 


lOC.O 




13 


52-7 


7.3 


7.3 


6.4 


17.3 


9.1 


100.0 




14 


47.0 


8.1 


14.1 


5.6 


13.1 


12.1 


100.0 




15 


48.0 


13.5 


10.6 


3.8 


14.4 


9.6 


100.0 




16 


36.7 


7.0 


4.8 


2.8 


28. i 


20.6 ^ 


100.0 




17+ 


30.0 


5.2 


6.3 


2.6 


29.1 


26.6 


100.0 




Total 


61.7 


5.8 


5.2 


2.7 


14.^ 


10.1 


100.0 
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TABLE 20 (continued) 



Respondent's Sibling's Educacion 



Education 


12 


13 


14 


15 


16 


17+ 


Total 


F. Observed/Expected Frequencies Under Independence 


2 

(G = 333.7 with 


2i df) 


12 


1.22 


. 75 


.83 


.77 


.62 


.50 




13 


.96 


1.25 


1.16 


2.13 


1.08 


.55 




14 


.69 


1.64 


2.71 


2.49 


.99 


1.18 




15 


.78 


2.53 


1.53 


1.41 


,84 


1.28 




16 


.57 


1.33 


.84 


1.16 


1.97 


2.07 




17+ 


.47 


i.23 


1.21 


.68 


1.99 


2.64 




G. Multiplicative Parameters 


of the 


Saturated 


Row X 


Column Model 




12 


. 2.48 


.83 


.99 


.91 


.82 


.66 




13 


1.33 


.95 


.95 


1.71 


.98 


.50 




14 


.73 


.95 


1.69 


1.52 


.68 


.81 




15 


.93 


1.64 


1,07 


.96 


.65 


.98 




16 


.72 


.92 


.63 


.85 


1.64 


1.70 




17+ 


.62 


.88 


.93 


.52 


1.71 


2.24 




H. Mos teller' 


s Adjustment to 


Uniform 


Marginal 


Sums 








12 


2.16 


.79 


.92 


.82 


.75 


.57 


6,00 


13 


1.19 


.93 


.92 


1.59 


.93 


.44 


6. 00 


14 


.66 


.93 


1.63 


1.42 


. 64 


.72 


6. 00 


^15 


.86 


1,64 


1.06 


.92 


.63 


.89 


f\ on 


16 


.65 


.90 


,61 


.80 


1.54 


1.51 


6.00 


17+ 


.52 


.81 


.85 


.45 


1.51 


1.86 


6.00 


Total 


6.00 


6,00 


6,00 


6,00 


6.00 


6.00 


3#.pO 


I. Parameter 


Labels Under Multiplicative Model 








12 


1 




2 


2 


2 


2 


* 


13 


2 


2 


2 


2 


2 


2 




14 


2 


2 


1 


2 


2 


2 




15 


2 


2 


2 


2 


2 


2 




16 


2 


2 


2 


2 


1 


1 




17+ 


2 


2 


2 


2 


1 


1 




2 

J. Relative Densities Under Multiplicative Model (G 


= 22.59 


with 24 


df) 


12 


2.40 


.75 


.93 


.77 


.92 


.75 




13 


.97 


.64 


,67 


1.09 


.81 


.42 




14 


.80 


,97 


1.80 


1.46 


.86 


1.04 






.79 


1.30 


.89 


. 72 


.64 


.98 




16 


.87 


1,02 


.74 


.89 


2.26 


2.38 




17 + 


.72 


• 95 


1.05 


.53 


2.29 


3.03 




Note: Parameter estimates under the 


model of 


Panel 


J are 5 ^ 


^ 2. 40 and 5 ^ ^ 


.84; S 


^/6^ « 2.85. 































TABLE 21 



Respondent's Broad Occupation Group by Friend's Occupation: Detroit Men 
(Laumann's Data from Jackson) 



Friend's 




Responde 


nt's Occupation 






Occupation 


1 


2 






Total 


A. Frequencies 














^ • \J^^^^ r/iiX^C V«v.'XXCl^ 


329 


162 


84 


ft? 


662 




2. Lower White Collar 


226 


284 


123 


IfiS 

X vJ ^ 


798 




3. Upper Blue Collar 


83 


103 


265 


?1 ft 


669 




4. Lower Blue Collar 


55 


97 


174 


418 


t HH 




Total 


693 


646 


646 


888 


2873 




B. Column Percentages 














1. UWC 


47.5 


25.1 


13.0 


Q P 


23 


0 


2. LWC 


32.6 


44.0 


19.0 


i. C3 . O 


27, 


8 


3. UBC 


12.0 


15.9 


41.0 




23, 


3 


4. LBC 


7.9 


^15.0 


26 9 


47.1 






Total 


100.0 


100.0 


100. D 


lQO-0 


100. 


0 


C. Observed/Expected Freq 


uencies Under Independence (G 


^ = 778.17 


witn 


^ ul / 


1. UWC 


2.06 


1.09 


.56 


A "5 
. *i J 






2. LWC 


1.17 


1.58 


.69 


. D / 






3. UBC 


.51 


.68 


1.76 


1.05 






4. LBC 


.31 


.58 


1.04 


1.32 






D. Estimated Frequencies Under Quasi- 


2 

-Symmetry. (G =2. 


25 with 3 


GZ) 




1, UWC 


329.0 


168.3 


83.7 


81.0 


662 




2. LWC 


219.7 


284.0 


128.3 


166.1 


798 




3. UBC 


83.3 


97.8 


26b. 0 


223,0 


669 




4. LBC 


61.0 


95.9 


169.0 


418.0 


744 




Total 


693 


646 


646 


888 


2873 




E. starting Values Under Quasi-Perfect Choice 


(Blocked 


Diagonal) 






1. UWC 


0 


1 


1 


1 






2, LWC 


1 


0 


1 


1 






3. UBC 


1 


1 


0 


1 






4. Lac 


1 


1 


1 


0 






2 

F. Observed/Expected Frequencies Under Quasi-Perfect Choice (G 


« 188. 


92 


with 2 df) 














1. UWC 


3.57* 


1.55 


.81 


.67 






2. LWC 


1.47 


1.61* 


.77 


.84 






3. uac 


.72 


.72 


2.05* 


1.47 






4, LBC 


.62 


.82 


1.53 


3.22* 







*Cell ignored in estimating the model. 
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TABLE .il 'continued) 



Friend's Respondent's Occupation 

Occupation 1 2 3 4 Total 

G. Parameter Labels Undpr Multiplicative Model 

1. UWC 5 2 11 

2. LWC 2 2 11 
3- UBC 113 2 
4. LBC 112 4 

H. Relative Densities Under Multiplicativ e Model (G^ = 6,46 with 5 df) 

1. UWC 3.27 1.22 .67 .60 

2. LWC 1.29 1.23 .57 .66 

3. UBC .66 .62 1.7i 1.21 

4. LBC .50 .67 1.29 2,66 

2 

I. Starting Values Under Model of Linear Density Gradient (G ^7.63 
with 9 df) 

1. UWC 5 2 11 

2. LWC 2 2 11 

3. UBC 113 2 

4. LBC 112 4 



Note! Paran^t$;r estimates, under the mQdel of Panel H are 5^ .622, 6 



1.245, 6^ = 1.712, 6^ » 2.664, 5^ « 3.268. 



2 



Ob 



TABLE ^ 



Tests for Change m Mobility from Father's (or Other Family Head's) Occupation 
to Son's First Full-time Civilian Occupation, U.S. fien Aged 20-64 in 1973 
by 5-Year Age Cohorts 



Model 



Change 



Level 
blanked out 



(?A) (WA) (H) 
df 



(PA) (WA? (HA) 
^2 



df 



Unblocked 
levels 

df 



Blocked level 
G^ dr a 



None 


235.3 


14a 


175.6- 


108 


59.7 


32 








1 


229.2 


132 


175.6 


108 


53.6 


24 


6". 10 


8 


>.5oa 


2 


216.6 


132 


17S.6 


108 


41.0 


24 


18. r 


8 


<.025" 




233.3 


132 


175.6 


108 


57.7 


24 


2.0 


8 


>.50«; 


4 


165.0 


96 


140.8 


72 


24.2 


24 


35.5* 


8 


<.001 


5 


11. 9 


24 


0.0 


0 


11.9 


24 


47.8* 


a 


<.00L 



NOTES: 7 « father's occupation, W « son 'a first occupation, A • Aga, 
H " dasign matrix from Table 4. 



♦Significant at the a » .05 level in a simultaneous testi 
critical ■ 20.09, glvea the S. tests reported here* 



FIGURE 1 

Volume of Mobility from Father's {or Other 
Family Head's) Occupation to Son's First 
Full-time Civilian Occupation, U.S. Men 
Acred 20-64 in March 1973 
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FIGURE 2 



Schematic Arrangement of Likelihood-Ratio Test Statistics (G ) 
for Change in Interaction Parameters and Lack of Fit of 
Multiplicative Models 



Interaction parameters 
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kth level 
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Variable 
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the model 
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""ack. oL fit at. 




of lit at all but 
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FIGURE 3 



Test Statistics for Selected Hyootheses About Interaction in Two or 
More Classifications 





Kull hypothesis 


Test 
statistic 




Degrees o£ 
r reeuom 


1. 


No change in interaction 
paranieters at any level 
of the design 


e b 




(L-l) (K-l) 




No change in Interaction 
parameters at any but the 
j^th level of the design 


^2 ^2 
°c - °d 




(L-1) (K-2) 


3. 


No change or lack of fit 
at the level of the 
design 


a c 






4. 


^ NO lack of fit at the k^ 
level of the design 


2 2 






5. 


No change in interaction 
parameters at tl-ie k*^ 
level of the design 






L-l 




NOTS: See Figure 2 for 


2 

definitions of G , 
a 







L ■ number of cross-classifications compared, 
K " number of levels of the design matrix, 
» n^jmber of independently variable cells 
at the k^ level of the design matrix* 



i02 



